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/7ika must remain a high priority 


Although the evidence suggests that the Olympic Games are safe to proceed, the global health 
community must not let the Zika virus fade from the research agenda until the threat is wiped out. 


virus? Or, more importantly perhaps, are the competitors and 

spectators? Athletes such as US women’s soccer goalkeeper 
Hope Solo have said they will go “begrudgingly’, and will barely leave 
their hotel rooms. Some have called for the event to be cancelled. 

This overestimates the risk of Zika to visitors. Although public- 
health agencies have advised pregnant women to avoid countries with 
active Zika transmission owing to the threat of birth defects, there are 
much more pertinent threats to the average visitor to Brazil, including 
dengue virus and random street violence. Much remains unknown 
about Zika, but a great deal is known, too, and it suggests that there is 
no reason to cancel the event. 

That we know so much so soon is a victory. When the US Centers 
for Disease Control and Prevention (CDC) concluded last month that 
the Zika virus causes birth defects, that marked the conclusion of one 
of the quickest-ever basic-science investigations into a crucial public- 
health issue. Some researchers expected it to take years. In the end, it 
took just six months. 

The CDC cited a growing number of studies using a range of 
approaches, including epidemiological, molecular and pathological, 
that showed a link between Zika and birth defects. The wisdom of 
making such a declaration so quickly remains a matter of some debate, 
but it is noteworthy that the global scientific community was able to 
organize itself in a relatively rapid fashion. There are three main rea- 
sons why this happened, and they offer lessons for research in future 
outbreaks. 

First, Brazil has a substantial amount of research infrastructure. 
Unlike some regions where outbreaks occur, such as nations in Africa, 
Brazil has made sizeable investment in research and public health. It 
is the only country in Latin America that invests more than 1% of its 
gross domestic product into research, and the country is well sup- 
ported by foreign research institutes, such as the US National Institutes 
of Health (NIH). The country spent US$31.9 billion on research in 
2013, for instance. 

This meant that when the Zika outbreak arose, there was a highly 
trained group of researchers ready to investigate the issue. The 
Oswaldo Cruz Foundation, a federally funded network of research 
stations across Brazil, has led investigations of the epidemiology and 
molecular aspects of Zika. And a long-running NIH-funded project 
on urban slum health in northeastern Brazil quickly adapted to set 
up large cohort studies to look at the possible link between Zika and 
birth defects. 

Second, there have been significant advances in neuroscience. 
Some of the most telling evidence for Zika's effects on human brain 
cells has come from studies of human neural stem cells and orga- 
noids — brain-like structures that can be grown in cell culture in the 
lab. Brain organoids were developed only a few years ago. By using 
organoids, human stem cells and human fetal tissue, researchers were 


A re this summer's Olympic Games under threat from the Zika 


quickly able to show that Zika preferentially targets and kills human 
neural precursors in the brain, offering a plausible explanation for a 
mechanism through which the virus causes birth defects. 

Third, the global health community has acknowledged that its dis- 
mally slow response to the Ebola epidemic let it grow out of control. 
Health officials were therefore determined to move more quickly in 
response to Zika. The World Health Organization declared the out- 
P break an emergency in February, and in the 

There same month US President Barack Obama 
should be no requested $1.9 billion for Zika-related activ- 
complacency.” ities. Although the US Congress has yet to 
approve that spending, the CDC said on 
13 May that it had found $85 million to distribute to states to help 
them prepare for the virus. 

There should be no complacency. The quick connection of dots 
between Zika and birth defects is only the beginning of the necessary 
mobilization. And we still don’t know why only some babies succumb 
to the devastating effects of Zika, or the most effective way to care for 
them. The questions will keep coming, and the research community and 
funders must continue to pursue them — with speed, strength and the 
highest urgency, as the organizers of the Olympics might say. m 


Second thoughts 


Revisting the past can help to inform ideas of 
the present. 


thoughts are deemed more noble than others. Darwin and Ein- 

stein could let their minds wander and imagine the consequences 
of certain actions or natural laws. But scientists and historians who try 
to estimate what might have happened if, say, Darwin had fallen off the 
Beagle and drowned, are often accused of playing parlour games. 

Most of these counterfactual thought experiments tend to focus on 
changes to the lives of historical figures — what would have happened 
had Hitler never been born, for instance. Dismissed as silly and specu- 
lative, such exercises are considered of little academic value, because 
the results of the experiment tend to align with what the experimenter 
would have wanted to happen. (One of first such published accounts 
seems to prove the point: Napoléon Apocryphe, published by a sup- 
porter in 1841, starts with the emperor surviving the 1812 Russian 
winter in an unburnt Moscow, conquering Europe, then Asia, Africa 
and the Americas, while discovering a new planet and inventing a fly- 
ing car on the way.) 


‘kK thought experiment has a noble place in research, but some 
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The course of history is surely contingent on the roles of influential 
individuals, which is why counterfactual tales of thwarted plots suc- 
ceeding and dictators killed as infants seem so poorly anchored to 
reality. An individual really can steer events, so a world without that 
individual is unknowable. But what about the course of science, and 
the ideas that push it along? Is scientific and technical progress equally 
contingent on circumstance and personality? Or are discoveries inevi- 
table, and independent of the people who happen to be around to 
make them? 

What if Darwin had toppled overboard before he joined the evolu- 
tionary dots? That discussion seems useful, because it raises interest- 
ing questions about the state of knowledge, then and now, and howit is 
communicated and portrayed. In his 2013 book Darwin Deleted — in 
which the young Charles is, indeed, lost in a storm — the historian 
Peter Bowler argued that the theory of evolution would have emerged 
just so, but with the pieces perhaps placed in a different order, and 
therefore less antagonistic to religious society. 

On page 293, another historian offers an alternative pathway for 
science: what if the ideas of Gregor Mendel on the inheritance of traits 
had been challenged more robustly and more successfully by a rival 
interpretation by the scientist W. E R. Weldon? Gregory Radick argues 
that a twentieth-century genetics driven more by Weldon’s emphasis 
on environmental context would have weakened the dominance of 
the current misleading impression that nature always trumps nurture. 

Unusually for a historian, Radick has some experimental data to 


draw from. Over a term, students at the University of Leeds, UK, 
where Radick teaches, were presented with a curriculum that relegated 
Mendel to the margins and promoted Weldon in his place. The result, 
perhaps not surprisingly, was a class of undergraduates who were less 

willing to see genetics as immutable destiny. 
“Science without _ Biologists may take issue with the methods, 
consensus would _ but the results seem less important than the 
be chaos.” fact that such an experiment could be per- 

formed at all. If the past is a foreign country, 
then it is also supposed to be one that cannot be revisited. With a little 
imagination, what other thought experiments could be tested in this 
way? The history of science is, after all, littered with major theories that 
became scientific orthodoxy but initially attracted a great deal of tough- 
minded criticism, from the Newtonian theory of universal gravitation 
to the Copenhagen interpretation of quantum mechanics. 

These ‘winners’ became dominant before all the criticisms against 
them were fully answered, which raises questions about why the 
debates went the way they did, and whether they could have gone 
otherwise — and if so, with what repercussions. 

A well-informed interest in alternative scientific pasts can help us to 
take the actual past more seriously as a source of present-day insight. It 
can also help us to stay self-critical as we make choices in the present. 
Science without consensus would be chaos. But the price of consensus 
is eternal vigilance against complacency, and a willingness to contem- 
plate the road otherwise not travelled. = 


Open medicine 


Governments need to tighten regulation if the 
sharing of clinical-trial data is to succeed. 


set protocols to test medicines and to treat patients, no such 
standards exist to compare clinical-trial data. 

The problem arises because each research group has a preferred 
method of collecting and categorizing results. Differences can be as great 
as omitting or including the gender and ethnicity of patients enrolled, or 
as mundane as the vocabulary used in medical records. For example, a 
study published in The BMJ this week challenged the ‘weekend effect’ — 
the idea that people in the United Kingdom admitted to National Health 
Service hospitals at weekends are more likely to die compared with those 
admitted on weekdays — by saying that stroke patients’ conditions are 
frequently miscoded by UK hospital staff (L. Li et al. Br. Med. J. 353, 
i2648; 2016). If proved, such a finding could cast doubt on research 
studies that use medical records as a data source. 

The lack of a single place to report descriptive metadata compounds 
the problem. Data sharing is most challenging for developing coun- 
tries, which often lack the resources for large-scale statistical analysis 
(L. Merson et al. N. Engl. J. Med. http://doi.org/bhmb; 2016). 

The latest attempt to address this problem is Vivli, described last 
week in The New England Journal of Medicine as a universal platform 
to “link existing data-sharing platforms and communities, while 
hosting data from investigators who aspire to share data but lack the 
resources to do so” (B. E. Bierer et al. N. Engl. J. Med. http://doi.org/ 
bhmc; 2016). 

The platform, to be run by the Multi-Regional Clinical Trials Center 
of Brigham and Women’s Hospital and Harvard in Cambridge, Mas- 
sachusetts, will initially employ curators to convert the clinical-trial 
data they receive from researchers into a standard, anonymized format 
and post it on the platform for other researchers to access on request. 
Eventually, the system's creators hope to switch to automated curation. 

Vivli’s creators acknowledge its limitations. It will no doubt be 


(sci science has a compatibility problem. Although there are 
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useful for researchers to verify findings, avoid duplicating trials and 
prioritize next steps. Similar efforts by scientific societies and research- 
ers on specific diseases have made progress in these directions. But 
data anonymization and standardization is time-consuming, and so 
only limited amounts of medical data will at first be available. 

Yet the best database in the world cannot address the underlying 
and pernicious deficiency in clinical-trial data sharing. Unless gov- 
ernment regulation is rapidly and substantially changed, Vivli will 
lack data from the majority of clinical trials — leaving it far from the 
comprehensive database that its creators envisaged. Despite US laws 
requiring trial data to be publicly disclosed in the Clinical Trials.gov 
database, lax enforcement, loopholes and ambiguous interpretation 
have allowed pharmaceutical companies to selectively report positive 
results, alter endpoints and avoid publishing data on time. 

Most trials disappear altogether. Some studies estimate that only 
one-third of trials for drugs approved by the US Food and Drug 
Administration (FDA) are ever published (J. S. Ross et al. PLoS 
Med. 6, €1000144; 2009). One assessment found that only two of 
the ten large pharmaceutical companies complied with FDA regu- 
lations on data reporting for drugs approved in 2012 (J. E. Miller 
et al. BMJ Open 5, e009758; 2015). Gilead and Sanofi Pharmaceuti- 
cals were the worst offenders according to author Jennifer Miller of 
Bioethics International. And private industry is not the only offender — 
about one-third of federally funded trials remain unpublished four years 
after completion (J. S. Ross et al. Br. Med. J. 344, d7292; 2012). 

Hidden clinical-trial data have long been an open secret in the field, 
but researchers are now starting to assess it more quantitatively. On 
17 May, the Laura and John Arnold Foundation awarded Bioethics 
International a US$3.6-million grant, which will allow the organization 
to assess trial-data publication, or lack of it, for all new FDA-approved 
drugs and biologics. 

Ideally, knowledge of such poor compliance will shock the 
public, pushing the FDA and other regulators into creating stronger 
incentives for companies to publish complete data, such as serious 
fines and penalties. Without such interven- 
tion, well-intentioned efforts such as Vivli 
are doomed to remain tragically incomplete, 
robbing researchers of the opportunity to truly 
link their work. = 
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WORLD VIEW accion 
iy "| Teach students the 


but also consider alternative scenarios. What might have 

happened, for example, if Britain had stayed out of the war 
in Europe in 1914? Science historians also ask such counterfactual 
questions, and the results can be surprisingly instructive. 

Take genetics. The past year has seen prolonged celebrations of 
the work of Gregor Mendel, linked to the 150th anniversary of the 
paper that reported his experiments with hybrid peas. Mendel’s 
experiments are central to biology curricula across the world. By 
contrast, the criticisms levelled at Mendel’s ideas by W. F. R. Weldon, 
Linacre professor at the University of Oxford, UK, are a footnote. 

From 1902, Weldon’s views brought him into increasingly bad- 
tempered conflict with Mendel’s followers. In basic terms, the Mendel- 
ians believed that inherited factors (later called 
‘genes ) determine the visible characters of an 
organism, whereas Weldon saw context — devel- 
opmental and environmental — as being just as 
important, with its influence making charac- 
ters variable in ways that Mendelians ignored. 
The Mendelians won — helped by Weldon’s 
sudden death in 1906, before he published his 
ideas fully — and the teaching of genetics has 
emphasized the primacy of the gene ever since. 

The problem is that the Mendelian ‘genes for’ 
approach is increasingly seen as out of step with 
twenty-first-century biology. If we are to real- 
ize the potential of the genomic age, critics say, 
we must find new concepts and language better 
matched to variable biological reality. This is 
important in education, where the reliance on 
simple examples may even promote an outmoded 
determinism about the power of genes. 

But what if Mendelism had never come to dominate genetics in the 
first place? What if Weldon’s perspective had emerged as the winner 
in that historical battle, and his interactionism, allied to his vivid sense 
of how variable the real characters of real organisms are (never just 
yellow or green, round or wrinkled, or any other Mendelian binary), 
had become the core of the subject? This is where I, and colleagues, 
have tried to run an experiment. 

In a recent two-year project, we taught university students a 
curriculum that was altered to reflect what genetics textbooks might 
be like now if biology circa 1906 had taken the Weldonian rather than 
the Mendelian route. These students encountered genetics as funda- 
mentally tied to development and environment. Genes were not 
presented to them as what inheritance is ‘really 


I ] istorians study the causes and consequences of past events, 
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78 biology of their time 


An experiment in genetics education reveals how Mendel’s legacy holds back 
the teaching of science, says Gregory Radick. 


levels and other influential factors, themselves often influenced by 
non-genetic factors (such as smoking). Where in this tangle, we ask 
them, is a gene for heart disease? In effect, this revised curriculum seeks 
to take what is peripheral in the existing teaching of genetics and make 
it central, and to make what is central peripheral. 

Our experimental group consisted of second-year humanities 
undergraduates. First-year biologists, who were taught the conven- 
tional approach, acted as our control. We saw a difference — those 
students taught the Weldon way emerged as less believing of genetic 
determinism, and, I suspect, better prepared to understand the subtle- 
ties of modern genetics. (The difference was statistically significant, 
but I hesitate to make much of that, given that numbers were small and 
there were differences between the groups. Iam mindful, too, that it was 
Weldon who first drew attention to Mendel’s own 
problems with exaggerated statistics.) 

With such experiments — bringing insights 
from the archive into the science classroom — 
the scientific past can inform and maybe even 
improve the scientific future. In turn, they sug- 
gest a broader vision of collaboration. To advance 
scientific knowledge, historians and philosophers 
of science should work in close proximity to sci- 
entists, not actually in the lab but right down the 
corridor. Then, investigations into neglected phe- 
nomena and debates that were shut down too soon 
might provide the spark to serve creative science. 

What of Mendel? Some might complain that 
it is a poor anniversary gift to jettison him from 
his place of honour in the genetics curriculum. 
Let me suggest that this grumbling, although 
understandable, is misguided. If we want to hon- 
our Mendel, then let us read him seriously, which is to say historically, 
without back-projecting the doctrinaire Mendelism that came later. 
Study Mendel, but let him be part of his time. 

Likewise, let our biology students be part of their time, by giving 
them a genetics curriculum fit for the twenty-first century. Ifwe teach 
them about Mendel, we should do so not to fill them with slack-jawed 
wonder at his foundational achievement, but to help them to appreciate 
how even the most imaginative and rigorous science — and Mendel's 
was first rate on both counts — bears the stamp of the historical cir- 
cumstances of its making. To learn that lesson about past science is to 
bring a welcome level of self-awareness and critical self-reflection to 
the present. = 


Gregory Radick is professor of history and philosophy of science at 
the University of Leeds, UK, and president of the British Society for the 
History of Science. This is an edited version of his presidential address, 
to be published in the British Journal for the History of Science. 
e-mail: g.m.radick@leeds.ac.uk 
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DIAGNOSTICS 


Fast Zika test 
developed 


A low-cost paper-based test can 
rapidly detect low levels of Zika 
virus in human blood samples. 

James Collins at the 
Massachusetts Institute of 
Technology in Cambridge 
and his colleagues previously 
designed a paper-based test 
for Ebola virus. Specially 
programmed RNA ‘sensors’ 
embedded in paper strips 
bind to viral RNA sequences, 
triggering a chemical reaction 
that changes the colour of 
the paper. The researchers 
adapted this for the Zika virus, 
and added a step that boosts 
the amount of viral RNA, 
increasing the test's sensitivity. 
They detected Zika virus in 
human serum spiked to mimic 
samples from infected patients. 
The test also distinguished Zika 
from the related dengue virus, 
and can differentiate between 
two similar strains of Zika. 

It takes about 5 days to 
design the RNA sensors and 
produce the paper strips, and 
roughly 3 hours to run the test, 
making the system potentially 
useful in future disease 
outbreaks, the authors say. 

Cell http://doi.org/bhijf (2016) 


Model predicts 
neutron-star signal 


Physicists have devised a fast, 
accurate model that recreates 
the gravitational-wave signals 
produced by spiralling and 
colliding neutron stars. The 
model could help researchers to 
work out the stars’ properties. 
The US Advanced Laser 
Interferometer Gravitational- 
Wave Observatory (LIGO) 
is looking for gravitational 
waves — ripples in space-time 
generated by cataclysmic 
cosmic events, such as 


Selections from the 
scientific literature 


Ancient Earth rocks emerge from lava 


Deep inside Earth are rocks that date back to the 
first 50 million years of the Solar System’ history. 
The discovery suggests that, even as internal 
heating mixes most of Earth’s interior, some 
primordial material still survives today. 

A team led by Hanika Rizo of the University of 
Quebec in Montreal, Canada, analysed lava that 


to form basalt rock (pictured). Both places are 
thought to tap deep and ancient sources of lava. 
The scientists found that levels of tungsten-182 
— an isotope created early in Earths history by 
the radioactive decay of hafnium-182 — were 
higher in the Canadian and Pacific rocks than in 
those from other parts of the world, confirming 


erupted off the coast of northern Canada and 
the southwest Pacific Ocean, and then cooled 


collisions of stars and other 
bodies. To learn more about 
neutron stars from these 
signals, researchers need a wide 
range of simulated examples 
of collisions for comparison. 
Creating a full simulation can 
take several weeks, so Tanja 
Hinderer at the University 
of Maryland in College Park 
and her colleagues produced a 
simplified model. By showing 
in detail how the stars deform 
under each other's strong 
gravitational pull, the model 
predicts waveforms that agree 
better with full simulations 
than do previous such models. 
The authors plan to use their 
model to create thousands of 
different possible waveforms 
that will help researchers to 
interpret LIGO data. 
Phys. Rev. Lett. 116, 181101 
(2016) 
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GENETICS 


Genes linked to 
different traits 


Seemingly unrelated 
biological traits, such as the 
risk of schizophrenia and of 
inflammatory bowel disease, 
can be linked to the same gene 
variants. 

One version of a gene can 
affect multiple traits, but 
identifying whether a specific 
gene variant affects two 
unrelated pathways has been 
difficult. Joseph Pickrell of the 
New York Genome Center and 
his team analysed data from 
43 genome-wide association 
studies (GWAS) that correlated 
gene variants with traits 
such as height, nose size and 
neurological disorders. They 
found several clusters of traits 
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that the rocks contained ancient materials. 
Science 352, 809-812 (2016) 


that shared common genetic 
roots, such as one associated 
with lower body-mass index, 
increased height and reduced 
risk of male pattern baldness. 
Considering data from 
many traits is important when 
studying the effect of a genetic 
variant, the authors say. 
Nature Genet. http://dx.doi. 
org/10.1038/ng.3570 (2016) 


ASTRONOMY 


Black hole weighed 
with precision 


Astronomers have made 
precise measurements of the 
mass of a supermassive black 
hole. 

Aaron Barth of the 
University of California 
in Irvine focused the 
Atacama Large Millimeter/ 
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submillimeter Array (ALMA) 
in Chile on the black hole at the 
heart of NGC 1332, a galaxy 
that is roughly 22 million 
parsecs (73 million light 
years) away. ALMAs high- 
resolution images revealed 
carbon monoxide gas orbiting 
just outside the black hole. By 
measuring the velocity of the 
gas, the team found the black 
hole to be 664 million times 
the mass of the Sun, with an 
uncertainty of roughly 10%. 
The technique improves 
on less-direct measurement 
methods. 
Astrophys. J. Lett. in the press; 
preprint at http://arxiv.org/ 
abs/1605.01346 (2016) 


EVOLUTION 


Commerce kept 
camel DNA diverse 


Ancient caravans travelling 
through the deserts of the 
Middle East and Africa over 
the past several thousand 
years have shaped the genetic 
diversity of domestic camels. 

To learn about camels’ 
evolutionary history, a team 
led by Pamela Burger at the 
University of Veterinary 
Medicine in Vienna and 
Olivier Hanotte at the 
University of Nottingham, 
UK, analysed DNA from 
the remains of 15 ancient 
dromedaries and from more 
than 1,000 living ones from 
21 countries. They found that 
domestic camels (Camelus 
dromedarius; pictured) 
are most closely related to 
wild ones now living in the 
southeast Arabian Peninsula, 
and that their DNA varies little 
by geography — a possible 
relic of ancient trading routes 
that mixed the genes of distant 
populations. 

Unlike some other 
domesticated 
livestock, 
camels seem 
to have largely 
maintained 
their ancestral 
genetic diversity. 
Proc. Natl Acad. 
Sci. USA http:// 
doi.org/bhf7 
(2016) 


Self-help for 
type 1 diabetes 


Cells from people with type 1 
diabetes can be reprogrammed 
so that they produce insulin 

in response to glucose when 
implanted into mice. This 
suggests that patients with 

the disease could one day 

be treated with their own 
reprogrammed cells. 

In previous work, Douglas 
Melton at Harvard University 
in Cambridge, Massachusetts, 
and his team took connective- 
tissue cells from healthy 
donors, reprogrammed them 
into stem cells and then grew 
B-cells, which make insulin 
in the pancreas. Now the 
team has done this using 
connective-tissue cells from 
people with type 1 diabetes. 
The B-cells derived from 
these patients were similar to 
those derived from healthy 
individuals — they prevented 
diabetes in mice that had 
lost their own B-cells, and 
responded to anti-diabetes 
drugs. 

Nature Commun. 7, 11463 (2016) 


‘Nanoblade’ slips 
organelle into cells 


A laser-based technique 
permits the delivery of 
energy-generating organelles 
called mitochondria into 
single mammalian cells, where 
they can restore metabolic 
activities. 
Mitochondria have 
their own DNA, which can 
cause disease if mutated, 
and researchers have been 
looking for ways to isolate 
the organelles, correct 
genetic defects and return the 
mitochondria to cells. Pei- Yu 
Chiou and Michael 
Teitell at the 
University of 
_ California, Los 
. Angeles, and 
their colleagues 
coated a 
micropipette 
tip with a 
100-nanometre-thick film 
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Popular topics 


SOCIAL SELECTION 


Epigenetics article kicks up astorm 


on social media 


A story in the 2 May issue of The New Yorker has been criticized 
for inaccurately describing how genes are regulated. The article 
by Siddhartha Mukherjee — a physician, cancer researcher and 
award-winning author at Columbia University in New York 
— examines how environmental factors change gene activity 
without altering DNA sequence. Jerry Coyne, an evolutionary 
ecologist at the University of Chicago in Illinois, wrote two 
blog posts calling the piece “superficial and misleading’, 
largely because it ignored key aspects of gene regulation. Other 
researchers quoted in the blog called it “horribly damaging” 
and “a truly painful read”. Mukherjee responded with a point- 
by-point rebuttal online. Speaking to 


> NATURE.COM Nature, he says he now realizes that he 
For more on erred by omitting key areas of science, but 
popular papers: that he didn't mean to mislead. “I thought 


go.nature.com/rnoh6u that I had done it justice, he says. 


of light-absorbing titanium 

and lightly touched it to a 

cell membrane. A laser pulse 
heated the tip, causing a bubble 
to quickly form and collapse in 
the cell’s culture medium. The 
bubble's expansion punctured 
the membrane, creating an 
opening large enough for the 
delivery of mitochondria. 

The technique’s 2% 
efficiency rate is higher than 
that of other methods, and 
the team is now working to 
increase throughput. 

Cell Metab. 23, 921-929 (2016) 


ECOLOGY 


Wolf cull makes 
poaching worse 


Government-approved killing 
of wolves (pictured) increases 
illegal hunting in parts of the 
United States. 
State-sanctioned culls are 
thought to be an effective 
conservation tool for 
reducing poaching of large 
carnivores. Guillaume 
Chapron at the Swedish 
University of Agricultural 


experienced periodic stages of 
protection and legal culling. 
The authors found that 
population growth slowed 
during periods of culling, 
regardless of the number of 
wolves culled. 

The authors ruled out 
other explanations for the 
decline, and suggest that 
culling increases people's 
intolerance of endangered 
predators, leading to more 
poaching. They recommend 
a moratorium on culling 
until its effects are better 


Sciences in Riddarhyttan understood. 

and Adrian Treves at the Proc. R. Soc. B 283, 20152939 
University of Wisconsin- (2016) 

Madison studied wolf 

populations in Michigan and > NATURE.COM 

Wisconsin between 1995 and For the latest research published by 
2012. During that time, policy Naturevisit: 


shifts meant that wolves WWwwW.nature.com/latestresearch 
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Methane rules 


The US Environmental 
Protection Agency finalized 
a trio of regulations on 

12 May that aim to reduce 
emissions of methane and 
other pollutants from oil 
and gas fields. Methane is 
25 times more powerful than 
carbon dioxide at trapping 
heat in the atmosphere, and 
the agency projects that 

the new regulations will 
reduce methane emissions 
by the equivalent of 

11 million tonnes of CO, in 
2025. The regulations apply 
only to new and modified 
operations, but the agency 
says that it is planning a 
separate regulatory process 
for existing ones. 


Costly climate 


The annual cost of adapting 
to a warming world for 
developing countries 

could reach US$280 billion 

to $500 billion by 2050, 
according to the United 
Nations Environment 
Programme's second 
Adaptation Gap Report, 
released on 10 May. Countries 
and multilateral institutions 
have increased adaptation aid, 
which totalled $22.5 billion 

in 2014, according to the 
report, but adaptation costs 
are already two or three times 
higher than that. The agency 
also called for a system to 
track investments and assess 
progress in meeting goals. 


New worlds galore 
NASAs Kepler space 
telescope has in one stroke 
more than doubled the list 

of planets it has discovered 
around other stars, thanks to 
a new method of validating 
signals of possible worlds. On 
10 May, scientists confirmed 
the existence of 1,284 new 


Porpoise on brink of extinction 


The vaquita (Phocoena sinus), a porpoise 
unique to the Gulf of California off Mexico, 
could be extinct by 2022, according to a report 
released on 14 May by the International 
Committee for the Recovery of the Vaquita 
(CIRVA). On the basis of acoustic data from 
submarine recorders and a survey of the 
vaquita's entire range, CIRVA estimates that 
there are now fewer than 60 animals left. Its 


extrasolar planets from a 
pool of 4,302 candidates 

that Kepler has spotted since 
2009. They include about 

100 roughly Earth-sized 
bodies and nine planets in the 
habitable zones of their stars. 
The announcement raises the 
total number of confirmed 
extrasolar planets to 3,264. 
See go.nature.com/mxwalg 
for more. 


Poor air quality 
The vast majority of city 
dwellers in poorer countries 
endure unsafe levels of air 
pollution, according to an 
updated database released 
by the World Health 
Organization (WHO) on 
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12 May. In low- and middle- 
income countries, 98% of 
people in large cities that 
monitor outdoor pollution 
breathe air that exceeds 

the limits set by the WHO, 
according to the database, 
which now covers 3,000 towns 
and cities in 103 countries. 
Among megacities with more 
than 14 million people, Delhi, 
Cairo and Dhaka experienced 
the worst levels of pollution 
from fine particles, which 
increase the risk of stroke, 
heart disease, lung cancer 
and asthma. Globally, urban 
air pollution worsened by 

8% between 2008 and 2013, 
despite improvements in 
Europe and the Americas. 
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dramatic decline of 92% in just under two 
decades is attributed to the use of gill nets 

in fishing. Mexico implemented a two-year 
gill-net ban in 2015, but some fishermen have 
continued to use the nets, resulting in the 
deaths of at least three vaquitas this year. Ata 
meeting in Ensenada, Mexico, last week, CIRVA 
scientists called on the Mexican government to 
make the gill-net ban permanent. 


Cosmic funding 

The Simons Foundation in 
New York will contribute the 
majority of a US$40-million 
donation to establish an 
observatory to look for 
primordial gravitational 
waves, the non-profit 
organization announced 

on 12 May. The Simons 
Observatory will scour the 
Universe's all-pervading 
background radiation for 
imprints of the waves. Finding 
them would provide evidence 
of a theorized period of rapid 
inflation in the moments after 
the Big Bang. (The BICEP2 
experiment made a claim to 
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have discovered this imprint 
in 2014, but subsequently 
retracted it.) The telescopes 
will be built in Chile's 
Atacama Desert and aim to 
be around ten times more 
sensitive than any currently 
operating facility. 


Funding rejig 

Britain's seven research 
councils are to be 

brought together into one 
‘supercouncil, according to 

a government White Paper 
unveiled on 16 May. The 
overarching body will be called 
UK Research and Innovation, 
and will also include the 
agency Innovate UK and the 
research-funding portions of 
the Higher Education Funding 
Council for England. Under 
the plan, the agencies will 
retain their own identities 

and separate budgets. The 
plan also calls for the creation 
of an Office for Students to 
oversee higher education, 

and introduces the Teaching 
Excellence Framework, which 
will evaluate teaching at UK 
universities and allocate 
funding on the basis of quality. 


Iikka Hanski dies 


Researchers around the 
world have been paying 
tribute to Finnish ecologist 
Ilkka Hanski (pictured), 
who died on 10 May, aged 
63. Hanski won the Crafoord 


Prize in Biosciences from the 


Royal Swedish Academy of 
Sciences in 2011 for his work 
on how spatial variation affects 
animal and plant populations. 
In 1991, he founded the 
Metapopulation Research 
Centre at the University of 
Helsinki to study species in 
fragmented landscapes. 


Academy head quits 


Biochemist Wong Chi-huey 
resigned on 10 May as the 
head of Academia Sinica, 
Taiwan's foremost research 
institution. After initially 
rejecting Wong's resignation 
in March, Taiwan's President 
Ma Ying-jeou accepted it 
following a controversy over 
Wong’s relationship with 
Taiwanese biotechnology 
company OBI Pharma. 
Wong's daughter reportedly 
sold shares in OBI before 
their value dropped following 
the release in February of 
disappointing results of a 
trial for a breast-cancer drug. 


SOBERING OUTLOOK 


Wong denied any allegations 
of insider trading, saying that 
he had no advance knowledge 
of the trial results. 


| BUSINESS 
Pharma deal 


Pharmaceutical giant 

Pfizer announced a 
US$5.2-billion deal on 

16 May to acquire Anacor 
Pharmaceuticals of Palo Alto, 
California, which specializes 
in boron-based compounds. 
Pfizer cited crisaborole, an 
experimental drug for the 
treatment of eczema that is 
under review for approval 
by the US Food and Drug 
Administration, as Anacor’s 
“flagship asset”. In April, 
Pfizer scrapped plans for a 
$160-billion merger with 
Dublin-based Allergan after 
the US treasury tightened up 
rules on companies moving 
their headquarters abroad to 
cut their tax bills. 


Execution drugs 


Drug multinational Pfizer 
will no longer sell drugs for 
use in executions by lethal 
injection, the company 
announced on 13 May. In 
banning the distribution 

of 7 drugs to correctional 
facilities, Pfizer joins 24 other 
companies that have ended 
such sales. “Pfizer makes its 
products to enhance and save 
the lives of the patients we 
serve, the company said ina 


Carbon dioxide emissions from fossil fuels will continue to rise for 


If current policies and regulations 
remain, energy-related carbon eee ee a 
dioxide emissions may rise = Coal 

xe : 16... == Liquids 
by 34% to 43 billion tonnes in = Aurel eas 
2040. In its International Energy 
Outlook 2016, released on 11 May, 
the US Energy Information 
Administration (EIA) projects 
that energy consumption will 
rise by 48% by 2040, spurred 
by growing demand in Asia. 
Although renewable energy is 
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fossil fuels are likely to continue to - 
supply more than three-quarters 1990 000 5010 


of the world’s energy, the EIA said. 


decades unless climate goals are pursued with a vengeance. 
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23-24 MAY 

The biotechnology 
conference Big Data 
Leaders Europe takes 
place in London. 
go.nature.com/uchnje 


24-26 MAY 

The Thirty Meter 
Telescope Science 
Forum, the facility's 
annual planning 
meeting, gathers in 
Kyoto, Japan. 
go.nature.com/sbnhua 


24 MAY 

The 13th and 14th 
satellites of Europe's 
Galileo navigation 
system are due to launch 
from the Guiana Space 
Centres 
go.nature.com/kqdwgk 


statement. All the makers of 
such drugs that are approved 
by the US Food and Drug 
Administration have now 
halted sales for executions, 
according to the London- 
based group Reprieve, which 
campaigns against the death 
penalty. 


Preprint purchase 
The Social Science Research 
Network (SSRN), one of 
the world’s most popular 
repositories of preprint 
research in economics, law 
and the social sciences, has 
been bought by academic- 
publishing giant Elsevier 
for an undisclosed sum. 
After the acquisition, which 
was announced on 17 May, 
the SSRN will continue to 
offer free submissions and 
downloads, said its chief 
executive, Gregg Gordon. 
Social Science Electronic 
Publishing, a private firm 

in Rochester, New York, 
founded the SSRN in 1994. 
See go.nature.com/5jnxqv for 
more. 
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A farmer works in southern Italy, where the bacterium Xylella fastidiosa pauca is devastating olive trees. 


PLANT PATHOGENS 


Olive tree gridlock eases 


Court ruling paves way for containment of deadly plant disease, but risk of spread remains. 


BY ALISON ABBOTT, BARI, ITALY 


southern Italy, plant pathologist Donato 
Boscia gestures towards a landscape of 
the dullest brown — dead and dying olive trees 
as far as the eye can see. Six months earlier, he 
says, that canopy was mostly green, with just a 
few tell-tale brown spots marking the relentless 
advance of a vicious pathogen, Xylella fastidiosa, 
that was previously unknown in Europe. 
Almost three years ago, Boscia, who heads 
the Bari unit of the CNR Institute for Sustain- 
able Plant Protection (IPSP), and other col- 
leagues identified a subspecies of the bacterial 
pathogen, Xylella fastidiosa pauca, as the cause 
of olive quick decline syndrome (OQDS) in 
Puglia. They said that it probably arrived with 


Be a small hill in the Puglia region of 


an ornamental plant imported from the Amer- 
icas, where some Xylella species are endemic. 

But an alarming political and legal impasse 
has stopped measures to contain the pathogen, 
which has invaded nearly 200,000 hectares of 
olive groves and is killing most olive trees in its 
wake — including beloved specimens that are 
more than 1,000 years old. 

Now the gridlock shows some signs of easing. 
On 12 May, the European Court of Justice 
declared that containment measures agreed by 
the European Union more than a year ago — but 
that were quickly blocked by dismayed protest- 
ers — were indeed appropriate, paving the way 
for them to now go ahead. These include moni- 
toring the disease’s spread, uprooting infected 
trees and, in some cases, any apparently healthy 
trees surrounding them, and killing the insects 
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that carry the bacteria. There is currently no 
cure for Xylella, and evidence from the Ameri- 
cas indicate that it cannot be eradicated. 

Still, challenges to fighting the outbreak 
remain. The infected area has grown over the 
past year, increasing the chance that the disease 
will spread to other countries in the Mediter- 
ranean basin, which accounts for 95% of the 
world’s olive production (see ‘Olive geography ). 

In February last year, Italian authorities 
began implementing the EU-agreed contain- 
ment plan. But protesters, with local political 
support, dismissed the evidence that OQDS 
was caused by Xylella, and that it cannot be 
cured or eradicated. They argued that the 
measures were not warranted and took their 
case to a local administrative court, which 
referred it to the European Court of Justice. 
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MSS IN FOCUS 


OLIVE GEOGRAPHY 


The world’s top ten olive-producing countries are 
situated in or around the Mediterranean basin, 
raising fears about the spread of an olive-tree 
infection that is ravaging southern Italy. 


e = 100,000 tonnes of olives produced 


> Paralysed by uncertainty, local authorities 
blocked even monitoring efforts. 

Aside from their economic importance, 
olive trees have a long historical and cultural 
connection with these regions. “So the public 
resistance is understandable,” says plant geneti- 
cist Francesco Salamini, who is leading a small 
group of scientists preparing an independent 
report on the affair for Italy’s national academy. 

Confusing perceptions further, last Decem- 
ber public prosecutors opened an investigation 
into whether five scientists — Boscia included 
— and five public officials negligently spread the 
disease, deceived officials about it and caused 
environmental pollution, among other things. 
IPSP director Gian Paolo Accotto, who is based 
in Turin, calls these notions “logically absurd”. 


CONTROL PLAN 

The case against containment began to unravel 
about eight weeks ago when the Parma-based 
European Food Safety Authority (EFSA), 
which provides independent scientific advice 
to the EU, released three reports’ * supporting 
the role of the bacteria in causing the disease 
and endorsing the EU-agreed containment 
measures. Puglia regional authorities then 
approved a new containment plan that defines 
a greater area of the region as infected (see 
inset in ‘Olive geography’). Early this month, 
administrators there agreed to restart monitor- 
ing. A few days later, the European Court of 
Justice made its declaration. 


‘) | MORE NEWS | 
Magic- @ LHC for quasiparticles reveals 
MORE mushroom material secrets go.nature.com/84ebte 
drug lifts @ Digital badges motivate scientists to 
ONLINE depression in share data go.nature.com/hyjbnm 
first human @ Underwater archaeologists unearth 
trial go.nature. ancient butchering site go.nature.com/ 
com/isijvz feazly 
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2015 border of 
containment area 


Adriatic 
Sea 


2016 border of 
containment area 


Data from 2013. 


The need for containment is highlighted by 
Boscia’s work over the past year inoculating 
different plant species with X. f pauca in mod- 
est greenhouses at the IPSP in Bari. This has 
revealed that citrus trees and grape vines are 
immune to the pathogen, but that it can infest a 
large number of local species such as lavender, 
oleander and polygala. Researchers say that 
even if all infected olive trees were destroyed, 
the bacteria would be so widely harboured in 
the environment that containment is the best 
measure. 


The Bari experi- “You can stop 
ments also revealed humans from 
that some cultivarsof trying to help, 
olive tree — includ- but youcan’t 
ing Leccino, themain stop bacteria, or 
cultivar in the Tus- put theminjail.” 


cany region — devel- 

oped milder symptoms of disease than do the 
predominant cultivars of Puglia. This could 
inform efforts to deal with the pathogen should 
it spread to other regions. It also helps replant- 
ing schemes, although more seasons of growth 
are required to see whether the Leccino trees 
survive long-term, says Boscia. 

Other experiments at the IPSP are analysing 
gene expression in infected trees to determine 
which molecular characteristics might offer 
some resilience. This could one day inspire 
treatments for infected trees. 

The Bari scientists, together with a group of 
worried olive producers, are also carrying out 
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experiments in the infected zone. Enzo Manni, 
head of an association that represents 600 small 
olive groves there, says that he was stonewalled 
a few years ago when he tried to interest local 
institutions in what seemed to be a threaten- 
ing new disease. “But I knew something was 
really wrong,’ he says. Now he helps the Bari 
researchers by identifying land for outdoor 
experiments and monitoring the tests. 


TOLERANCE TEST 

In one of these experiments, the team of scien- 
tists and producers — which has now grown 
to 40 — has planted different cultivars to test 
whether any show tolerance over many sea- 
sons. Such experiments could guide which 
trees get planted in the region, or elsewhere. 
In another study, the scientists are grafting dif- 
ferent cultivars onto newly infected trees to see 
whether some survive, thereby offering hope 
of recreating canopies on diseased root stock. 
In some cases, they are funding some of these 
experiments themselves so as not to lose time. 

Accotto applauds the scientists’ commitment 
in spite of the police investigations. “You can 
stop humans from trying to help, but you can't 
stop bacteria, or put them in jail,’ he says. 

Fearing the spread of Xylella, scientists in 
other countries are looking to join in. Those 
from Spain, the world’s largest producer of 
olives, have added a couple of the country’s cul- 
tivars to the experiments to get an idea of how 
fast their trees might succumb to the disease. 

The United Nations Food and Agriculture 
Organization (FAO), based in Rome, organ- 
ized an April workshop in Bari to share infor- 
mation with scientists and agriculture officials 
from olive-growing countries in Europe, North 
Africa and the Middle East. 

The delay to the containment programme in 
Italy has probably increased the risk of OQDS 
spreading abroad, says Shoki Al-Dobai, head of 
the FAO’s North Africa and Middle East office 
in Cairo. He is organizing a contingency plan 
for that area. Syria and Libya, where olive cul- 
tivation is believed to have begun in 2,500 Bc, 
are of particular concern, he says, because wars 
in those countries make appropriate monitor- 
ing impossible. m 


1. Saponari, M. et al. Pilot Project on Xylella fastidiosa 
to Reduce Risk Assessment Uncertainties (EFSA, 
2016); available at http://go.nature.com/aldezo 

2. EFSA Panel on Plant Health. EFSA J. 14, 4450 (2016). 

3. EFSA Panel on Plant Health. EFSA J. 14, 4456 (2016). 
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Brazil’s science woes mount 


Demotion of science ministry rubs salt in wound following severe federal funding cuts. 


BY CLAUDIO ANGELO 


razil’s scientists, already struggling 
B to absorb massive funding cuts, are 

protesting against another blow: the 
country’s science ministry has been demoted 
by interim president Michel Temer, who took 
over the government on 12 May after a Sen- 
ate impeachment vote ousted Dilma Rousseff 
from the presidency. 

One of Temer’s first actions was to announce 
the fusion of the federal Ministry of Science, 
Technology and Innovation (MCTI) with the 
ministry that deals with telecommunications 
and Internet regulation. Science is now an 
office within a ‘superministry led by Gilberto 
Kassab, a former mayor of Sao Paulo. 

The move, which Temer had begun to hint 
at a few days before, has angered some Bra- 
zilian researchers. On 11 May, 13 scientific 
associations, led by the Brazilian Society for 
the Advancement of Science and the Brazilian 
Academy of Sciences (ABC), sent a letter to 
Temer warning that the fusion would be “det- 
rimental to the country’s scientific and tech- 
nological development”. Researchers say that it 
would erode the authority of the MCTI, which 
has formed the backbone of federal support for 
science and innovation in Brazil for the past 
three decades. 

“An administrative reorganization should 
stem from a vision for the country. It shouldn't 
simply be an artificial lumping of disparate 
activities,’ says ABC president Luiz Davidovich. 
The move is “a step backwards’; he says. 

Temer had first caused concern among 
scientists the week before, when he hinted to 
local media that he might appoint a creationist 
evangelical bishop to lead the science ministry. 
“Unfortunately, the science ministry is usually 
among the first bargaining chips in every new 
government, says José Eduardo Krieger, dean 
of research at the University of Sao Paulo. 

Brazil has had three science ministers in 
the past 16 months — and each was seem- 
ingly appointed for political ends rather than 
for any particular expertise. In January last 
year, Rousseff picked Aldo Rebelo, an avowed 
climate-change sceptic in the Communist 
party (the closest ally of Rousseff’s centre-left 
Worker's Party). He was followed by Celso 
Pansera, whose nomination was regarded as 
an attempt to lure votes from Temer’s Bra- 
zilian Democratic Movement Party against 
Rousseff’s impeachment. And Emilia Ribeiro, 
former vice-minister of the MCTI, has been 
science minister since April 2016. 


Interim President Michel Temer has decided that Brazil will no longer have a separate science ministry. 


Pansera says that such high turnover is 
detrimental to research. “It gives you a lot of 
discontinuity,’ he says. 


FUNDING SLASHED 

The most hurtful discontinuity for Brazil’s 
researchers has been startling cuts to federal 
funding. Last year, as the nation’s fiscal crisis 
began to bite, the MCTT’s budget was cut by 
some 1.9 billion reais (US$540 million), to 
5.4 billion reais. Its budget for this year was set 
37% lower than last year’s — and its author- 
ized spending has been chopped by another 
6%. The collapse of oil prices and a bribery 
scandal that involves Petrobras, the behemoth 
state-run oil company, have further reduced 
cash flows for Brazilian research, which partly 
depends on oil revenue. 

A major victim of the economic downturn 
is the flagship exchange programme Science 
Without Borders, which by the end of 2015 
had sent nearly 94,000 Brazilian undergradu- 
ate and postgraduate students to institutions 
abroad. The programme intended to send a 
further 100,000 students abroad by 2018, but 
its second phase has been called off. 

To revive science spending, the govern- 
ment had authorized the MCTI to negotiate a 
US$1.4-billion loan from the Inter-American 
Development Bank in Washington DC. But 
that operation has been reset, Pansera says. 
“Now we'll have to renegotiate everything with 
the new government's economic team.” 

Research institutes are struggling to survive. 
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At the Brain Institute at the Federal University 
of Rio Grande do Norte in Natal, researchers 
split basic maintenance expenses such as Inter- 
net bills among themselves and buy equipment 
with their own money, says institute director 
Sidarta Ribeiro. 

And some scientists are already planning to 
leave the country. Suzana Herculano-Houzel, 
a neuroscientist at the Federal University 
of Rio de Janeiro, launched a crowdfund- 
ing campaign last year to buy new parts for a 
microscope. But she has now decided to move 
to Vanderbilt University in the United States: 
this September, she will shut down her lab in 
Brazil, which had been running for a decade. 

Brazilian scientists are familiar with 
economic crises. In the 1990s, budget con- 
straints were the rule, recalls Paulo Artaxo, a 
physicist at the University of Sao Paulo. But 
before the current crisis, the country’s research 
had been experiencing unprecedented expan- 
sion, with funding on the rise and the number 
of PhD students soaring. In 2011, Brazil rose to 
13th in the world in research-paper production, 
up from 17th in 2001. That trend is reversing, 
says Rogério Meneghini, a specialist in science 
metrics and scientific director of SciELO, a 
collection of mainly Latin American journals. 
Brazil's research-paper production grew 16% 
between 2011 and 2014 — but his figures sug- 
gest that it dropped by 4% in 2015. “What is 
cruel about this is that, when you cut off on 
science, you can never resume from where you 
stopped’, Krieger says. “You lose position.” m 
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The RRS James Clark Ross is one of Britain’s two ageing polar ships, which are being replaced. 


Fleet of polar ships 
raises science stakes 


RRS Sir David Attenborough — named Boaty McBoatface 
by the Internet — is part of a wave of hardy ice- going vessels. 


BY ALEXANDRA WITZE 


r MVhe United Kingdom's new polar research 

vessel has become a national obsession. 

A proposal to christen the ship Boaty 
McBoatface captured the public imagination 
— so much so that the decision to name the 
ship instead after broadcaster and naturalist 
David Attenborough triggered questions in a 
parliamentary inquiry. 

But the RRS Sir David Attenborough, which 
will probe both the Arctic and Antarctic, is 
also notable because it will carry more scien- 
tists and push deeper into polar ice than any 
UK research vessel ever has. It joins a wave 
of ice-strengthened research vessels — from 
Norway to China to Australia — that promise 
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to expand scientists’ ability to explore harsh 
polar environments (see ‘Ships of the future’). 

China is soliciting bids to build a large 
research icebreaker to accompany its exist- 
ing ship Xuelong, and the Australian gov- 
ernment signed a contract last month for a 
replacement for its ageing Aurora Australis. 
Germany has started the process of replacing 
its vessel Polarstern, and Sweden is beginning 
to discuss what to do after its ship Oden is 
retired towards the end of the next decade. (A 
proposed pan-European icebreaker, costing 
up to €800 million (US$900 million), is on 
hold owing to its enormous price tag.) 

The 129-metre-long Attenborough is 
designed by the team behind the Kronprins 
Haakon, an icebreaker being built for the 
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Norwegian Polar Institute in Tromsg. When 
Kronprins Haakon sets off on its first science 
cruise in 2018, it will give Norwegian research- 
ers a serious upgrade in their access to Arctic 
seas. It will replace a smaller vessel that can 
carry fewer scientists and operate only in light 
ice. “It's a whole new world for us,’ says project 
manager Wystein Mikelborg of the Norwegian 
Polar Institute. 


OVER ICE 

Within weeks, architects will finalize the 
Attenborough’s design and workers in Mersey- 
side, UK, will begin cutting steel for its hull, on 
track for a 2019 delivery. 

Designers are cramming as many research 
goodies as possible into the £200-million 
(US$290-million) ship. The Attenborough 
will have a helicopter deck and, unlike Britain's 
existing polar-research vessels, a hangar. This 
will allow scientists to fly to otherwise inac- 
cessible lakes or islands in Antarctica. And 
the Attenborough will have a hole in the hull 
known as a moon pool, allowing researchers 
to deploy oceanographic and geological equip- 
ment more smoothly and safely than by swing- 
ing it off the side of the ship. 

Laboratory spaces will be kept at different 
temperatures, allowing storage and experi- 
ments of different types. The deck will have 
extra room for custom-made equipment for 
different cruises. Modern fibre-optic cables 
will deliver a live camera feed from as deep as 
6,000 metres, from a remotely operated vehi- 
cle that will be called Boaty McBoatface, ina 
concession to the public vote. 

Yet this increased research capability comes 
at a price: fewer days of actual science. 

The British Antarctic Survey (BAS) 
currently operates two polar vessels, the 
25-year-old RRS James Clark Ross for science, 
and the 21-year-old RRS Ernest Shackleton, 
which delivers equipment, food and people 
to its bases. To cut annual operating costs by 
20%, the bigger Attenborough will replace 
both ships. 

Instead of 180 days of science a year, UK 
scientists will get 150. The rest of the time (and 
space on the ship) will go to ferrying cargo. “I 
don't like to use the word compromise, but 
there is a trade-off,’ says Andrew Jeffries, 
project manager for the new boat and a BAS 
engineer in Cambridge, UK. 

There have been other trade-offs between 
research and logistics. Ship designers have 
had to place the vessel's enormous diesel gen- 
erators on isolating platforms to make the 
ship run silently, to avoid disturbing seismic 
and biological acoustics studies. The current 
design may also have to be tweaked to avoid 
vents from the helicopter fuel storage opening 
directly into areas where atmospheric scien- 
tists were counting on having clean air for their 
measurements. 

Even with these trade-offs, researchers are 
eager for the Attenborough to launch. Not only 
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SHIPS OF THE FUTURE 


As climate change alters the polar seas, nations 


are upgrading their ability to conduct research 
in these ice-strewn waters. 
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will it be able to push through thicker ice and 
carry more scientists, but the ship will also be 
able to explore deeper environments than any 
other UK research vessel, says Katrin Linse, a 
deep-sea biologist at the BAS. That ability will 
allow it to carry out the first direct sampling 
of the 8.5-kilometre-deep South Sandwich 
Trench in the southern Atlantic Ocean. “We 
might find a new ecosystem there that we are 
not aware of yet,’ she says. “It’s pretty exciting 
to get a new vessel. That only happens once 
every 30-40 years.” m 


SYNTHETIC BIOLOGY 


Enzyme boost for 


mirror-image life 


Polymerase that can copy left-handed DNA marks 
step forward for looking-glass biochemistry. 


BY MARK PEPLOW 


it. Researchers at Tsinghua University 

in Beijing have created a mirror-image 
version of a protein that performs two of the 
most fundamental processes of life: copying 
DNA and transcribing it into RNA. 

The work is a “small step” along the way 
to making mirror-image life forms, says 
molecular biologist Jack Szostak of Harvard 
Medical School in Boston, Massachusetts. 
“It’s a terrific milestone,’ adds his Harvard 
colleague George Church, who hopes one 
day to create an entire mirror-image cell. 

Many organic molecules are ‘chiral’: they 
can exist in mirror-image forms that cannot 
be superimposed, like a right-handed and 
left-handed glove. But life almost always 
employs one version: cells use left-handed 
amino acids, and have DNA that twists like a 
right-handed screw, for instance. In princi- 
ple, looking-glass versions of these molecules 
should work together in the same way as nor- 
mal ones — but they might be resistant to 
attack by viruses or enzymes that have not 
evolved in a looking-glass world. 

That makes mirror-image biochemistry a 
potentially lucrative business. One company 
that hopes so is Noxxon Pharma in Berlin. 
It uses laborious chemical synthesis to make 
mirror-image forms of short strands of DNA 
or RNA called aptamers, which bind to thera- 
peutic targets such as proteins in the body 
to block their activity. The firm has several 
mirror-aptamer candidates in human trials 
for diseases including cancer; the idea is that 
their efficacy might be improved because 
they aren't degraded by the body’s enzymes. 
A process to replicate mirror-image DNA 
could offer a much easier route to making 
the aptamers, says Sven Klussmann, Noxxon 
Pharmas chief scientific officer. 


E biochemistry — but not as we know 


THROUGH THE LOOKING-GLASS 

Researchers have been making chunks of 
mirror-DNA for decades, so the Tsinghua 
team could order much of what it needed for 
its looking-glass DNA replication attempt 
from a chemical supplier — a mirror-DNA 
strand to be copied, mirror-DNA building 
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blocks and a shorter mirror ‘primer’ strand 
that could pick up these building blocks in 
the right order. 

The difficult task was to make the mirror- 
image enzyme that coordinates the copying 
process, called DNA polymerase. That would 
need to be synthesized from right-handed 
amino acids, but commonly used polymer- 
ase enzymes have more than 600 amino acids 
— meaning that they are too big for current 
synthetic methods. 

So the Tsinghua team turned to the smallest 
known polymerase: African swine fever 
virus polymerase X, which contains just 
174 amino acids. Unfortunately, it is also 
spectacularly slow — probably because of its 
small size, says synthetic biologist Ting Zhu, 
a former graduate student of Szostak’s who 
helped to lead the work. The team made a 
mirror version of the enzyme and found that, 
like its natural equivalent, it could extend a 
mirror-primer consisting of 12 nucleotides 
(DNA building blocks) to an 18-nucleotide 
mirror-DNA strand in about 4 hours; and to 
a 56-nucleotide strand in 36 hours. 

When the normal and mirror-image ver- 

sions of these systems 


“For a while were mixed together 
mirror-image __ inthe same test tube, 
biochemistry both replication 
was anon- processes worked 
field. But now independently with- 
it seems very out interference. 
vibrant.” The mirror-image 


polymerase could 
also transcribe mirror-DNA into mirror- 
RNA — arelatively rare feat for a polymerase 
— again at a glacial pace. The work is pub- 
lished in Nature Chemistry (Z. Wang et al. 
Nature Chem. http://dx.doi.org/10.1038/ 
nchem.2517; 2016). 

Klussmann says that Noxxon Pharma is 
interested in pursuing a similar approach with 
amore efficient enzyme. Indeed, Zhu and his 
colleagues next hope to build a mirror-image 
of a more efficient polymerase known as 
Dpo4, which is built of 352 amino acids. 

In their research paper, the Tsinghua 
researchers also present their work as an 
effort to investigate why life's chirality is the 
way it is. This remains mysterious: it may > 
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> simply be down to chance, or it could have 
been triggered by a fundamental asymmetry 
in nature. But Steven Benner, at the Foun- 
dation for Applied Molecular Evolution in 
Alachua, Florida, says it’s unlikely that creat- 
ing a mirror form of biochemical life could 
shed light on this question. Almost every 
physical process behaves identically when 
viewed in a mirror. The only known excep- 
tions, called ‘parity violations, lie in the realm 
of subatomic physics. Such tiny differences 


would not show up in these biochemical 
experiments, says Benner. (He is also inter- 
ested in making DNA that can avoid unwanted 
degradation by natural enzymes or viruses, 
but rather than using mirror-DNA, he has 
created artificial DNA with non-natural build- 
ing blocks.) 

Church's ultimate goal, to make a mirror- 
image cell, faces enormous challenges. In 
nature, RNA is translated into proteins by 
the ribosome, a complex molecular machine. 


“Reconstructing a mirror-image of the 
ribosome would be a daunting task,” says Zhu. 
Instead, Church is trying to mutate a normal 
ribosome so that it can handle mirror-RNA. 

Church says that it is anyone’s guess as to 
which approach might pay off. But he notes 
that a growing number of researchers are 
working on looking-glass versions of bio- 
chemical processes. “For a while it was a 
non-field,’ says Church. “But now it seems 
very Vibrant.” = 


BIOTECHNOLOGY 


Bankruptcy of nanomedicine 
firm worries drug developers 


Financial troubles of leading biotech firm highlight challenges of making innovative drugs. 


BY HEIDI LEDFORD 


ot long ago, investors flocked to 
| \ | a firm in Massachusetts that 

was hailed as the leader in 
a wave of next-generation nano- 
technology companies develop- 
ing ways to ferry cancer drugs 
to tumours. But on 2 May, the 
company — BIND Therapeu- 
tics — declared bankruptcy. 

Researchers in the field of 
nanomedicine are waiting 
anxiously to see whether the 
Cambridge-based firm will 
pull through its financial crisis 
— and whether its troubles will 
taint the swiftly evolving field 
of nanoparticle drug delivery. 
“Tt’s been a rapid rise and fall,” says 
Eric Schmidt, a biotechnology ana- 
lyst at the investment bank Cowen 
and Company in New York City. “Tt’s 
all unravelled pretty quickly.” 

Because nanoparticles lessen the amount 
of contact that cancer drugs have with healthy 
tissue, they offer a chance to deliver higher 
doses with fewer side effects. In 1995, the US 
Food and Drug Administration approved the 
first such treatment, Doxil, which packages a 
chemotherapy drug called doxorubicin in a 
lipid nanoparticle. The particles are too large 
to escape from normal blood vessels — and so 
are less toxic to the heart than naked doxoru- 
bicin — but they can seep out of the leaky blood 
vessels often found in tumours. 

BIND’s nanoparticles were designed to tar- 
get tumours more precisely than liposome 
particles can. The company’s lead product, 
BIND-014, involves a polymer particle coated 
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BIND Therapeutics’ nanoparticle is coated in 
molecules that target it to tumours. 


with a molecule that steers the particle to a 
protein found in many tumours. The particle 
releases the chemotherapy drug it carries, called 
docetaxel, inside the tumour. 

Early tests in animals and small clinical 
trials showed that the approach was safer 
than docetaxel alone — and fuelled BIND’s 
US$70.5-million initial public offering in 2013. 
But later clinical trials disappointed. BIND-014 
failed against cervical and head-and-neck can- 
cers. Although it was somewhat effective against 
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one type of lung cancer, it was not clear whether 
the drug worked any better than regular doc- 
etaxel, says BIND’s chief scientific officer 

Jonathan Yingling. 

In April, the company announced 
that it would cut back on its work 
with BIND-014, and Yingling says 
that the firm will now explore 
new targets. It cut the number 
of employees by 38% and aims 
to trim its expenses to $6 mil- 
lion per quarter — a dramatic 
decrease for a company that 
spent $11 million on research 
and development alone in the 
first quarter of 2016. 
After one of its creditors 

demanded that BIND repay a 

loan ahead of schedule, the com- 
pany filed for bankruptcy (see 
‘Troubled times’). It plans to dis- 
pute the need for early repayment at 

a legal hearing on 18 May. “BIND is and 
will remain open for business,’ Andrew 

Hirsch, president of the company, told inves- 
tors on 9 May. 

Schmidt says that BIND remains at the 
technological forefront of nanoparticle drug 
delivery, but waited too long to move away 
from BIND-014. By then, the investor enthusi- 
asm for biotechnology that had driven BIND’s 
initial public offering had cooled. “People are 
not interested in funding technology right now,’ 
Schmidt says. “They're interested in funding 
later-stage projects. And the one at this com- 
pany didn’t have what it takes.’ 

In the time since BIND-014 was developed, 
researchers have also realized that differences 
between tumours — such as size, density and 
leakiness of the blood vessels that lace through 
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them — can affect how well nanoparticles 
penetrate them, says Warren Chan, a biomed- 
ical engineer at the University of Toronto in 
Canada. “You should eventually be able to per- 
sonalize the nanoparticles to the need,’ he says. 
“Tt’s just that we're not even close to there yet.” 


PERSONAL APPROACH 

Omid Farokhzad, who studies nanomedicine at 
Brigham and Womens Hospital in Boston, Mas- 
sachusetts, and is a co-founder of BIND, thinks 
that companies should determine whether a 
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nanoparticle would penetrate a patient's tumour 
before administering the therapy. Farokhzad 
points to Merrimack Pharmaceuticals, also in 
Cambridge, which used an imaging agent called 
ferumoxytol to assess the effects of its lipo- 
some-encased chemotherapy in a clinical trial 
last year. Early results suggested that tumours 
that took up ferumoxytol were more likely to 
respond to the nanoparticle-encased drug. 
Meanwhile, a third generation of nanoparti- 
cles is in the works: particles that shuttle larger 
molecules such as RNAs, says Pieter Cullis, 


INFOCUS Xai 


who studies nanomedicine at the University 
of British Columbia in Vancouver, Canada. 
Cullis has been working with Alnylam, a bio- 
technology firm also in Cambridge, to develop 
nanoparticles that deliver therapeutic RNA 
molecules to the liver. The excitement over 
gene-editing techniques such as CRISPR-Cas9, 
which could one day be used to correct disease- 
causing genes, is also fuelling interest in using 
nanoparticles to shuttle in the RNAs and pro- 
teins needed for the method, he says. 

But some are worried that BIND’s struggles 
could frighten investors away from the field. 
Mark Davis, a chemical engineer at the Cali- 
fornia Institute of Technology in Pasadena, is 
quick to note that other nanoparticles are show- 
ing promise. Last month, Celator Pharmaceu- 
ticals of Ewing, New Jersey, announced that its 
liposome-wrapped chemotherapies are effec- 
tive against a form of leukaemia. And Ceru- 
lean Pharma of Waltham, Massachusetts — a 
company that Davis co-founded — is making a 
polymer-based, targeted nanoparticle that has 
shown promising results in early trials when 
combined with other chemotherapies. 

“T personally would be shocked if 20 years 
from now, were not seeing a lot of nanotech- 
nologies helping a lot of people,’ says Robert 
Langer, a bioengineer at the Massachusetts 
Institute of Technology and a co-founder of 
BIND. “It will happen.” = 
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Stronger, tougher, stretchier:\with a simple new 
recipe, metallurgists are creating a generation of 


alloys 


ith remarkable'properties: 


BY XIAOZHI LIM | 


t first glance, the machine seems to be building a miniature 
cityscape. A ring of nozzles fires four jets of powdered metal 
into a downward-pointed laser beam, which fuses the colliding 
grains in a bright orange glow. The mixed grains then solidify on 
the growing tip of a small pillar of metal alloy. Once the pillar is 
1-2 centimetres high, the platform that holds it shifts to the side, and the 
machine starts to build another one right next door. The result looks like 
a forest of toy skyscrapers. 

In reality, these towers, generated at the Ames Laboratory in Iowa, 
reflect a major shift in how researchers think about alloys. The standard 
recipe — used for technologies ranging from ancient swords and arrow- 
heads to modern jet-engine turbines — is to take a useful metal and mix 
in a pinch of this or a touch of that to improve its properties. One classic 
example is the addition of carbon to iron to make steel. 

But the machine at Ames is making experimental samples of ‘high- 
entropy alloys, which consist of four, five or more elements mixed 
together in roughly equal ratios. This deceptively simple recipe can yield 
alloys that are lighter and stronger than their conventional counterparts, 
while being much more resistant to corrosion, radiation or severe wear. 
Eventually, researchers hope, this approach could even produce alloys 
that have magnetic or electrical properties never seen before, leading 
to whole new generations of technology. 

“We have almost explored everything for traditional alloys,’ says Yong 
Zhang, a materials scientist with the State Key Laboratory for Advanced 
Metals and Materials at the University of Science and Technology 
Beijing. “For high-entropy alloys, the science is very new, he says — so 
new that no such alloy has yet made the leap from lab to market. But some 
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researchers are working to make that hap- 
pen, eyeing potential applications that range 
from high-temperature furnace linings to 
ultralightweight aerospace materials. And 
the field has attracted funding from research agencies in China, Europe, 
the United States and elsewhere. 

“Were not talking about a narrow class of materials, but an extremely 
broad philosophy on how to combine elements,” says Daniel Miracle, a 
materials scientist at the Air Force Research Laboratory at the Wright- 
Patterson Air Force Base in Ohio. “The opportunity to find something 
new and exciting is very high.” Last year, he and his colleagues esti- 
mated’ that almost 313,560 different alloys can be made by combining 
exactly equal proportions of 3, 4, 5 or 6 metallic elements from a set of 
just 26. More possibilities can come from varying the proportions or 
expanding the choice of elements. 

But not every combination is a winner, says Easo George, a materials 
engineer at Ruhr University Bochum in Germany. Scientists are still 
learning what works and what doesn't. Still, he says, “the space avail- 
able for exploration is really huge, and we have only looked at a small 
portion of the Universe”. 

The idea for high-entropy alloys first struck metallurgist Jien-Wei 
Yeh in 1995, while he was driving through the Taiwanese countryside. 
The physics of conventional alloys was already well understood, says 
Yeh, who works at National Tsing Hua University in Hsinchu, Taiwan. 
At the atomic level, pure metals have a regular crystal structure that 
stacks layer upon layer of identical atoms. Often, these layers slip past 
each other easily, which makes the metal too soft to be useful. That is 


High-entropy alloys could 
be used in hotter-running 
jet-engine blades. 
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SOURCE: ADAPTED FROM FIG. 4 IN GLUDOVATZ, B. 
ET AL. NATURE COMMUN. 7, 10602 (2016). 


why pure gold is rarely used in jewellery: it cannot survive much wear 
and tear. But ifa metalsmith mixes in an element with a different atomic 
size, the interloper will randomly disrupt the layers and reduce their 
tendency to slip, which creates a much harder alloy. The correct choice 
of compounds can enable metallurgists to tailor other properties as well, 
such as corrosion resistance or melting point. 

But Yeh was also well aware of the potential complications. If too 
much of the alloying element were 
added, for example, its atoms could 
stop falling randomly among the 
layers, and instead might start to alter- 
nate with the primary metal atoms in 
a more regular pattern, producing a 
compound that is weak and brittle. 

And that gave him an idea, says Yeh: 
instead of starting with one primary 
material and mixing in small quanti- 
ties of one or two elements, why not 
stir together similar quantities of four 
or five elements — or even more? The 
number of possible ways for the dif- 
ferent atoms to arrange themselves 
would expand dramatically, leading 
to a tendency towards disorder, or 
‘high entropy, that would overwhelm 
any bias favouring a regular crystal- 
lattice structure. Because each of the 
randomly mixed elements would be a 
different size, the atoms would become 
lodged into place and less able to slide 
past each other, creating a very hard 
material (see “Tough and strong’). 

It was a strange idea at the time, and Yeh gave it a low priority even 
in his own laboratory; it wasn't until 2004 that his research group first 
reported success at mixing five to ten elements together’, producing 
alloys that were considerably harder than stainless steel. Another group 
independently announced similar findings around the same time’. 
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FRACTURE TOUGHNESS > 


SPOILT FOR CHOICE 

The field began to move rapidly after that. In 2009, Zhang described* 
an alloy composed of cobalt, chromium, copper, iron, nickel and alu- 
minium that was more than 14 times stronger than pure aluminium, but 
had nearly 3 times the ductility — a measure of a metal’s ability to stretch 
without breaking. In 2011, Yeh devised’ a cobalt, chromium, iron, nickel, 
aluminium and titanium alloy that is twice as resistant to damage from 
friction as conventional wear-resistant steels. And in 2014, George and his 
team concocted’ a cobalt, chromium, iron, manganese and nickel alloy 
that can be cooled below liquid-nitrogen temperatures without becom- 
ing brittle. The material could be useful for cryogenic vessels, natural-gas 
pipelines and other low-temperature applications, such as spacecraft. 

Often, however, the best thing about high-entropy alloys — the 
massive number of possibilities — can also be researchers’ biggest chal- 
lenge. With more than 80 metallic elements in the periodic table, says 
Miracle, “there are way too many alloys to test, and not enough time’. 
For his own work on high-entropy alloys for aircraft engines and aero- 
plane frames, he is searching for materials that are lighter, more resistant 
to corrosion and better able to maintain their strength at high tempera- 
tures than anything currently available. To cope with the abundance of 
choices, Miracle is focusing on elements such as niobium, tantalum and 
chromium, which have high melting points to begin with. 

Another strategy is to try to replicate the characteristics of alloys that are 
known to work well. For example, some steels are not just a random mix of 
atoms, but instead contain small nodules of compounds that form when 
the steels cool rapidly. Although such a composite structure is less stable 
than a random mixture, it confers high ductility to the steels. Cem Tasan, 
a metallurgist at the Massachusetts Institute of Technology in Cambridge, 


TOUGH AND STRONG 


By mixing multiple metals in roughly equal proportions, 
high-entropy alloys can offer unique combinations of yield 
strength and fracture toughness. 


Resistance to deformation (log scale) 
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has used that knowledge to blend iron, manganese, cobalt and chromium 
into a high-entropy alloy’ that is simultaneously extremely hard and 
highly ductile — traits that once seemed completely incompatible. “Tt 
doesn't make sense to abandon all that we know,’ he says. 

The mini-skyscrapers at Ames represent another, more systematic 
approach. The machine can build up to 30 pillars in less than an hour 
with a slightly different mix of raw materials in each, so that researchers 
can test the properties of many alloys 
quickly. Matthew Kramer, a materials 
scientist at Ames, leads a project to find 
high-entropy alloys that can withstand 
high temperatures and resist corro- 
sion, which could help power plants 
to operate at higher temperatures and 
become more efficient. 

Aiding his team is Duane Johnson, 
a theorist at Ames who, in 1995, devel- 
oped an algorithm to predict the prop- 
erties of conventional alloys before 
they are made’. In 2015, he expanded 
the code to work for high-entropy 
alloys’. Johnson’s algorithm assesses 
how much one element is attracted to 
or repelled by another, and then using 
that information to predict whether a 
mixture of elements will form a com- 
pound, a solid solution or a mixture of 
both. That enables Kramer's team to 
identify which alloys might be worth 
investigating. The experimental results 
are then fed back into the algorithm to 
validate and improve the code. 

There are a number of obstacles to overcome to move the high- 
entropy-alloy field forward. So far, the emphasis has been on improv- 
ing structural properties, such as strength. But there has been much less 
work on developing alloys with specific ‘functional’ properties, includ- 
ing conductivity or response to a magnetic field — a development that 
would enable applications in areas such as refrigeration and electronics. 

Nonetheless, there are many possibilities still to be explored, especially 
as researchers start to extend the concept well beyond its original defini- 
tion. Zhang, for example, is mixing metals with elements such as carbon, 
nitrogen and silicon in an effort to develop new high-temperature ceram- 
ics for use in solar-power applications. 

And some, including Tasan and Yeh, have begun to experiment 
with alloys that contain elements mixed in high, but unequal propor- 
tions. Their preliminary findings show that many of these still have all 
the properties that make high-entropy alloys desirable in the first place. 
For example, Yeh has prepared a range of hard materials that consist of 
50% nitrogen, carbon or oxygen, along with a mix of other elements, such 
as aluminium, silicon or titanium, that can withstand scratching. They 
could be used as long-lasting coatings for machine parts and cutting tools. 
“We now have this rich, rich field for exploration,’ says George. m 
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XiaoZhi Lim is a freelance writer based in Singapore. 
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Old-fashioned breeding techniques are bearing more 
fruit than genetic engineering in developing 


onathan Lynch likes to look beneath the 
surface. In his quest to breed better crops, 
the plant physiologist spends a lot of time 
digging up roots to work out what makes 
some varieties extremely good at extracting 
nutrients from the ground. Lynch wants to use 
this knowledge to develop plants with extra- 
efficient roots — crops that grow well in the 
nutrient-starved soils of the developing world. 
These plants could also reduce the use of fertiliz- 
ers in richer nations. 
Last year, Lynch’s forays into the dirt paid off. 
He and his team at Pennsylvania State Univer- 
sity in University Park reported’ that they had 


308 | NATURE | VOL 533 | 19 MAY 2016 


self-sufficient super plants. 


BY NATASHA GILBERT 


produced a variety of common bean, or string 
bean (Phaseolus vulgaris), with a combination 
of root traits that allows it to take up phospho- 
rus from the soil with improved efficiency. In 
experimental plots, the plants produced three 
times the bean yield of typical varieties. 

That result has raised hopes in Africa, where 
common beans are one of the most important 
sources of protein for poor people. Researchers 
in Mozambique are testing how Lynch’s beans 
perform in the country’s ecological zones, and 
they expect to win regulatory approval to bring 
the crop to market by next year. 

Lynch’ beans are among the first successful 
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attempts in a global race to develop crops that 
grow well in soils depleted of nutrients. “Low 
availability of nitrogen, phosphorus and water 
are the main limitations of plant growth on 
Earth. We desperately need this technology,’ 
says Lynch. 

His work stands out because he has taken an 
old-school approach. He is leading a renais- 
sance in some conventional crop-breeding 
techniques that rely on laboriously examin- 
ing plants’ physical characteristics and then 
selecting for desirable traits, such as growth 
or the length of fine roots. 

And surprisingly, this approach seems to 
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Farmers in the 
developing world 
would benefit from 
nutrient-efficient crops. 


be outpacing the 
high-tech route. 

Big corporations 
such as DuPont 
Pioneer in Johnston, 
Iowa, have spent more than a decade developing 
improved crops through genetic engineering, 
and some companies say that their transgenic 
varieties look promising in field trials. But there 
are still no fertilizer-frugal transgenic crops on 
the market, and several agricultural organiza- 
tions around the globe are reviewing their bio- 
technology initiatives in this area. 

Plant biologist Allen Good of the University 
of Alberta in Edmonton, Canada, spent years 
working with companies to develop geneti- 
cally modified (GM) crops that require little 
fertilizer, but he says that this approach has not 
been as fruitful as conventional techniques. 
The problem is that there are so many genes 
involved in nutrient uptake and use — and 
environmental variations alter how they are 
expressed. 

“Nutrient efficiency was supposed to be 
one of those traits with broad applicability 
that could make companies lots of money. But 
they haven't developed the way we thought,” 
says Good. 

Despite the scientific and breeding chal- 
lenges, some researchers say that all strategies 
must be explored to develop crops that are less 
nutrient needy. With the global population 
heading towards 10 billion people by 2050, 
frugal crops could be essential to feed the 
planet. “There is a huge worldwide potential 
for these traits to help increase food produc- 
tion and sustainable development,’ says Matin 
Qaim, an agricultural economist at the Univer- 
sity of G6ttingen in Germany. 


MAGIC BEANS 

Lynch says that he spent years “groping around 
in the dark” before he worked out which root 
characteristics help P vulgaris to grow in low- 
phosphorus soils. Screening for root traits is a 
messy and difficult business, so it is unpopular 
among researchers, he says. That's why most 
previous research had focused on crop shoots 
rather than their roots. His team dug up plants 
to decipher which characteristics were benefi- 
cial, taking great care to avoid damaging the 
fragile root structures. The researchers didn't 
even know what traits they were looking for in 
the tangled masses, which could contain thou- 
sands of roots per plant. 

The key to success is having an open mind 
and just seeing what the root looks like, says 
Lynch. “I tell my students they should drop 
acid before they go to the field, and just look 
at the plants and let them tell you what they 
are doing” 

One feature that interested Lynch and his 
team was root depth, because they knew that 
phosphorus tends to stick to the upper layers 
of soil. The researchers reported’ last year that 
beans with basal roots — those close to the 


base of a plant — that grow at a shallow angle 
produced 58% more above-ground biomass 
than beans with steeper basal roots. The team 
also discovered that plants with long root hairs 
took up phosphorus better and improved the 
bean yield by 89%, relative to the variety with 
shorter root hairs. When the researchers knit- 
ted both the root traits together, they saw much 
bigger gains. In field trials in Mozambique, 
beans with shallow roots and long root hairs 
gave yields of 1,500 kilograms per hectare in 
soils depleted in phosphorus, compared with 
local varieties that give just 500 kilograms per 
hectare. 

The beans could bea boon for African farm- 
ers who struggle with depleted soils. Accord- 
ing to the Food and Agriculture Organization 
of the United Nations, soils in sub-Saharan 
Africa lose around 26 kilograms of nitrogen 
and 3 kilograms of phosphorus per hectare 
each year through farming and erosion. 
Because many farmers cannot afford to replen- 
ish these nutrients with fertilizers, this stunts 
yields to about one-tenth of those in developed 
countries. 

The next big research challenge is to 
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lateral roots rather than many short ones. This 
arrangement requires less energy to grow, so 
plants can invest more resources in above- 
ground vegetation and developing deep roots 
that can suck up water and nitrogen. 

Tests in US fields starved of nitrogen 
showed? that maize with few long lateral roots 
produced yields more than 30% higher than 
maize with many short lateral roots. The long- 
lateral-root variety performed even better in 
a test of drought-like conditions, where it 
showed a 144% increase in yield’. 

Other research teams are taking different 
approaches to breeding fertilizer-frugal crops. 
Over the past six years, a group at the Interna- 
tional Maize and Wheat Improvement Cen- 
tre (CIMMYT), headquartered in Texcoco, 
Mexico, has had success using conventional 
breeding to create maize that grows well in the 
nitrogen-starved soils of Africa. 

Field trials show that CIMMYT’s maize 
produces up to 20% higher yields than existing 
commercial maize varieties in low-nitrogen 
soils. By the end of 2015, the centre had won 
regulatory approval for 14 nitrogen-efficient 
maize varieties across 6 countries, including 


“THERE IS A HUGE WORLDWIDE POTENTIAL FOR THESE 
TRAITS TO HELP INCREASE FOOD PRODUCTION.” 


improve the beans’ tolerance to drought 
while maintaining their ability to thrive in 
low-phosphorus soils. This could be tricky 
because crops under water stress need deep 
roots — the opposite of what they need to 
absorb phosphorus, says Magalhaes Miguel, a 
plant physiologist at the Agricultural Research 
Institute of Mozambique in Chimoio who 
collaborates with Lynch. Miguel is testing the 
beans in different locations around Mozam- 
bique to win approval for seed companies to 
produce and sell the crops. 

Lynch’s phosphorus discoveries have 
already made the jump into farmers’ fields 
in China. His colleagues there have shown’ 
that some of the same root traits that boost 
common beans also improve growth in soya 
beans (Glycine max). The regulatory system 
in China is faster at approving new crop varie- 
ties than that in Mozambique, so the improved 
soya beans are already in use and have been 
planted on 1 million hectares so far. 

Lynch has also had some success in improv- 
ing crops uptake of nitrogen, which tends to 
accumulate deeper in the soil. The ability to 
grow deep roots is influenced by several fac- 
tors, including the characteristics of a plant's 
lateral roots — those that grow out of the main 
framework of roots. Working in maize, or corn 
(Zea mays), Lynch found that plants perform 
better under low-nitrogen conditions® and 
water stress* when they have just a few long 
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Malawi and Kenya. The organization has more 
than 50 other varieties in the works. Biswanath 
Das, a maize breeder who leads CIMMYT’s 
project in Nairobi, says that the team is still 
investigating why some varieties are better 
than others at finding and using nitrogen. 
“We hope to have some answers in the next 
two years, says Das. 


BOOM AND BUST 

As Lynch and CIMMYT make big leaps 
forward, they are overtaking agricultural- 
biotechnology companies that have invested 
years of work in tests with GM crops. 

Good, who was involved in some of the early 
experiments, remembers those heady times. In 
2001, he walked up and down rows of transgenic 
and conventional oilseed rape (Brassica napus, 
also known as canola) growing side by side in 
Californian test fields starved of nitrogen. He 
marvelled at the contrast between the bigger, 
greener GM plants, flourishing in the stress- 
ful conditions, and their unmodified relatives 
that were struggling to survive. The GM plants’ 
advantage was a transgene: alanine aminotrans- 
ferase (AlaAT), which came from barley. 

“I thought, ‘Oh my god, you can see the 
gene working in the field? says Good. A few 
years earlier, he had discovered that AlaAT 
— which has an important role in how plants 
metabolize nitrogen and incorporate the nutri- 
ent into grain — could help crops to thrive 
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in low-nitrogen soils. The gene is still one 
of the more advanced targets for transgenic 
approaches to farmers’ over-dependence on 
fertilizer. 

The 2001 trials were run by Arcadia 
Biosciences, an agricultural-biotechnology 
company in Davis, California, for which Good 
was working at the time as a consultant, and to 
which he had licensed the AlaAT discovery. In 
a collaboration with biotechnology company 
Monsanto in St Louis, Missouri, separate field 
studies” showed that the transgenic oilseed rape 
produced 42% more seeds under low-nitrogen 
conditions than did conventional controls. 
They also found that the GM crop needed 40% 
less nitrogen fertilizer to achieve yields equiva- 
lent to those of unmodified crops. 

Arcadia says that it has successfully put the 
gene into other crops, including wheat and 
rice. The company has one of the industry’s 
most robust nitrogen-efficiency programmes, 
says Roger Salameh, Arcadia’ chief operating 
officer. “We have significant evidence from mul- 
tiple years of field trials in the two largest food 
crops in the world — wheat and rice — that we 
are driving double-digit yield gains,” he says. 

Despite the positive results, Arcadia has not 
yet commercialized these or related crops. The 
company has worked on the nitrogen-efficient 
wheat with several other companies and 
governmental agencies, including the Com- 
monwealth Scientific and Industrial Research 
Organisation (CSIRO), Australia’s national 
science agency, but the CSIRO pulled out of 
that project. A spokeswoman for the organiza- 
tion says that it “did not see results that met its 
objectives so is no longer actively researching 
this area’. 

Arcadia was hoping to have a transgenic 
variety of nitrogen-efficient wheat on the mar- 
ket this year, but it will miss that target. Salameh 
says that it is uncertain when the plant will be 
ready, but the company is now working with 
Vilmorin and Cie, a seed company in Saint- 
Beauzire, France, to commercialize the wheat. 


FROM THEORY TO PRACTICE 
Qaim and some other agricultural analysts are 
impressed with the results of several transgenic 
efforts. For example, Arcadia and its research 
partners say that they have developed a trans- 
genic variety of rice that produced, on average, 
30% higher yields than conventional controls 
during four years of field trials in Colombia 
under low-nitrogen conditions. However, 
Qaim says that the crops’ performance has yet 
to be demonstrated in realistic field conditions. 

Good has heard that the GM oilseed rape is 
‘tweaky, meaning that it performs well in some 
environments but not others. He has lost his 
initial rapture with it, and says that if he had his 
time again, he would put the lion's share of his 
efforts into conventional breeding approaches 
like those of Lynch and CIMMYT. 

Good and the CSIRO are not the only ones 
worried that the transgenic crops are not yet 
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Specially bred maize (left) has roots that transport nutrients better than a conventional plant. 


delivering as hoped. The CIMMYT effort to 
improve maize yields in low-nitrogen soils 
kicked off in 2012 with a project to develop a 
transgenic variety as well as the conventional 
maize. The transgenic plants are based on tech- 
nology donated by DuPont Pioneer. They have 
produced some increases in yields, but not yet 
enough to warrant the huge effort required to 
gain regulatory approval, says Das. The trans- 
genic programme is now under review. “The 
yield benefits are just not that exciting for com- 
mercialization yet,’ he says. 

DuPont Pioneer has also seen some delays 
in its transgenic products targeted to US mar- 
kets. The company told Nature in 2010 that it 
would have a product available by 2022 (see 
Nature 466, 548-551; 2010). It now says that 
the products are still in the proof-of-concept 
stage, and will not be ready until the middle 
or late 2020s. 

David Spielman, an economist at the Inter- 
national Food Policy Research Institute in 
Washington DC, says that the delays don't 
necessarily signal disaster: companies fore- 
cast precise timelines to satisfy investors, but 
the science doesn't always deliver according to 
corporate plans. The pushed-back timelines 
do not mean that the scientific advances wont 
come at some point in the future, he says. 

“Tt is more important to consider if the traits 
are showing improvement and if public and 
private sectors are investing in the research 
with a longer-term view of the path to com- 
mercialization,’ says Spielman. 

A key challenge for transgenic efforts is that 
traits such as nutrient-use efficiency are influ- 
enced by a complex web of genes that interact 
with each other and the environment. “It's very 
difficult to take one transgene and stick it into a 
maize variety and expect a big effect,’ says Das. 

DuPont Pioneer and Arcadia are still con- 
fident that a single gene will one day deliver 
the nutrient-efficient goods. “People question 
if we could find a single gene with a big enough 
effect, but we are having real success,’ says Tom 
Greene, research director at DuPont Pioneer. 
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And Salameh also remains confident about 
this approach. 

Meanwhile, researchers are trying to com- 
bine old-fashioned breeding with a variety of 
high-tech methods. Lynch is now using a com- 
puter model that simulates natural selection 
to work out what combination of root charac- 
teristics will boost crops’ performance under 
nutrient stress. He is also identifying genetic 
markers for useful root traits, which should 
help to indicate whether plants’ offspring 
have the desired genes and should speed up 
the breeding processes, he says. 

Government regulation could perhaps be 
the biggest hurdle to both conventional and 
transgenic frugal crops. Regulatory hurdles 
in Europe and Africa keep many transgenic 
technologies from reaching large-scale field 
testing and commercialization, says Qaim. 
These hurdles bear part of the blame for the 
slower-than-expected progress in transgenic 
frugal crops, he says. 

And in some African countries, it can take 
up to ten years to get new conventional vari- 
eties to market because of stringent testing 
requirements, says Das. Farmers and breeders 
are also hesitant to take a risk on new varieties. 

But if CIMMYT can get the seeds into 
farmers’ fields, the impact on food security 
and the environment in developing countries 
could be massive, says Das. “The beauty of it 
is that the solution is packaged in the seed,” 
he says, so there is no need for extra fertilizer. 
“Once the farmers have the seed, they have the 
solution.” = 


Natasha Gilbert is a writer in 
Washington DC. 
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Global standards for 
stem-cell research 


New guidelines from the International Society for Stem Cell Research offer a model 
for self-regulation in contentious areas, write Jonathan Kimmelman and colleagues. 


promise for biomedicine. It also raises 

vexing ethical and policy challenges. 
It can involve the destruction, creation and 
modification of human embryos, and has 
led to the premature marketing and use of 
unproven therapies. 

On 12 May, in response to scientific pro- 
gress and evolving ethical concerns, the 
International Society for Stem Cell Research 
(ISSCR) issued updated and extended guide- 
lines’ for work involving the manipulation 
of stem cells and the translation of that work 
into medical therapies. The ISSCR is an 


Gre research offers tremendous 


independent non-profit organization that 
was established in 2002 to provide a forum 
for communication and education in the 
emerging field of stem-cell research and 
regenerative medicine. The society devel- 
oped guidelines for embryonic-stem-cell 
research’ in 2006 and for clinical translation 
of stem-cell research’ in 2008. We represent 
the working groups that drew up the new 
guidelines. 

The revised ISSCR guidelines provide a 
model of self-regulation for other potentially 
contentious research areas. Today’s science 
engages many different actors: researchers, 
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taxpayers, regulators, journals, sponsors, 
industries and, often, patients. Meanwhile, 
manuscripts, protocols, tissues and even 
patients routinely cross national bounda- 
ries. In this landscape, different stakehold- 
ers need to be confident that their interests 
will be protected when they collaborate with 
parties who might have differing views or 
goals. International guidelines are better 
positioned than national laws to help ensure 
protection. 

The new ISSCR guidelines span 27 pages. 
Here we highlight the most dynamic areas 
for policy, from the introduction of 
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» heritable changes into the human genome 
to the use of sham surgical procedures in the 
testing of cell-based interventions. 


RESEARCH CHALLENGES 

Human embryos. In the decade since the 
ISSCR’s previous guidelines were issued, 
embryo research has entered new arenas. 
Mitochondrial-replacement techniques 
(MRT) may soon be used to replace dysfunc- 
tional mitochondria in eggs or embryos with 
those obtained from healthy donors. In the 
United Kingdom, a pathway for bringing this 
approach to clinics was approved last year. 
And a committee convened by the National 
Academy of Medicine provided recommen- 
dations in February to the Food and Drug 
Administration that would enable clinical 
testing in the United States. 

Mitochondrial diseases result in debilitat- 
ing physical, developmental and cognitive 
disabilities. MRT could reduce the chances of 
women passing muta- 


tions associated with “A growing 
these diseases onto body of 

their children, butthe research 
processes also carry requires the 
risks that are poorly yee of fresh 
understood. human eggs.” 


More-contentious 
gene-editing techniques such as CRISPR- 
Cas9 now enable researchers to modify the 
nuclear DNA of human sperm and eggs (gam- 
etes) and embryos. As with MRT, there are 
uncertainties about the safety of these tech- 
niques. Both MRT and editing the nuclear 
DNA of human gametes or embryos would 
introduce potentially heritable alterations 
into the human genome. Societal consensus is 
lacking on whether making changes that can 
be inherited to the genomes of individuals is 
something that humankind should pursue. 

Because of this, the new ISSCR guidelines 
assert that any attempt to modify the nuclear 
genome of human embryos for the purpose 
of human reproduction be prohibited at this 
time. The revised guidelines do, however, 
endorse continued laboratory-based research 
on human embryos and the derivation of 
stem-cell lines from them. 

Just after the first ISSCR guidelines were 
issued in 2006, scientists reported the deriva- 
tion of induced pluripotent stem (iPS) cells. 
These are adult cells that are reprogrammed 
to an embryonic-like state. Although enor- 
mously valuable, iPS cells do not obviate 
the need for human embryonic stem cells in 
research’, In fact, a better understanding of the 
different states of pluripotency has renewed 
biologists’ interest in deriving embryonic stem 
cells with distinct properties. It has also led 
to a growing recognition that common ani- 
mal models inadequately recapitulate many 
aspects of human embryonic development’. 

For research involving human embryos, 
the revised guidelines assert the need for 
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a specialized embryo-research-oversight 
(EMRO) process. 

Institutions are best positioned to decide 
what specific mechanism to use to review 
embryo research. One option may be to 
repurpose existing embryonic-stem-cell 
research oversight (ESCRO) committees to 
take on a broader embryo-research-oversight 
function. Regardless of process, researchers 
and reviewers are urged to adhere to certain 
ethical principles. Among the ‘points to con- 
sider’ listed by the ISSCR are: whether donors 
of eggs or embryos have provided informed 
consent; the justification for the study; the 
number of embryos that will be used; and the 
quality of the study design. 

The principles embodied by the revised 
EMRO process should be applied to the 
development of MRT, and to investigations 
of embryo-like structures, which are being 
increasingly used to model various stages of 
human development in the lab®. Such experi- 
ments warrant rigorous EMRO review to 
eliminate prospects that structures with the 
potential for integrated human organismal 
development are kept in vitro for anything 
more than the minimal periods required to 
address compelling scientific questions. 


Human eggs. A growing body of research 
requires the use of fresh human eggs, whether 
in mitochondrial replacement, gene editing 
in vitro or nuclear transfer (a form of cloning 
for research). 

Egg donation is invasive and time-consum- 
ing: it involves hormone treatment and the 
retrieval of eggs by needle biopsy. There are 
also uncertainties about its long-term effects. 
The practice raises several issues, including 
how to compensate women for the risks and 
discomfort but avoid economic exploitation. 
The new guidelines propose standards. They 
recommend that when women are paid, the 
compensation is in line with that received by 
volunteers in other research’. 


ON THE UP 


In recent years, the number of clinical 
trials involving stem-cell therapies has 
increased worldwide. 
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Human-animal chimaeras. Various 
groups are implanting human tissues into 
the bodies or brains of pigs, non-human pri- 
mates or rodents. The resulting ‘chimaeric’ 
organisms are used to study human organ 
development and aspects of brain function, 
and to establish models of human cancer. 
Such transfers of human tissue raise ques- 
tions about animal welfare and the limits of 
permissible chimaera research: it could alter 
animal cognition or pain perception, or lead 
to the formation of human gametes in the 
target animal. 

The uncertainty over what is ethically 
defensible in this area was made appar- 
ent last year. The US National Institutes of 
Health suspended its funding of certain 
categories of animal-human chimaera 
research to first evaluate “the ethical issues 
that should be considered, and the relevant 
animal welfare concerns”. 

The ISSCR guidelines offer standards 
for researchers and reviewers that draw 
on welfare considerations that are broadly 
applicable to transgenic animals. They also 
recommend that certain categories of exper- 
iments be prohibited, such as the breeding 
of non-human animals that might harbour 
human gametes. 


Induced pluripotent stem cells. It is unclear 
in many national policies whether stud- 
ies that involve iPS cells should undergo 
a specialized stem-cell research oversight 
(SCRO) or an ESCRO review. The ISSCR 
guidelines recommend that iPS cell work 
be instead subject to institutional oversight 
of studies involving human participants, 
supplemented with stem-cell-relevant 
informed consent procedures. This would 
free up SCRO or ethical-review committees 
to focus on ethically sensitive research activi- 
ties involving embryos. 


CLINICAL CHALLENGES 

Irreproducible results and the incomplete 
reporting of findings from preclinical stud- 
ies are of particular concern for emerging 
interventions involving the transfer of liv- 
ing human cells. Decades of research have 
yielded some general insights about the 
behaviour of drugs in people. By contrast, 
the mechanisms underlying potential cel- 
lular therapies remain poorly understood 
for most tissues. And unlike drugs, which 
are metabolized and excreted from the body, 
stem cells and their progeny can persist, 
sometimes for life. 

The ISSCR's 2016 guidelines articulate a 
detailed set of expectations regarding the 
design, reporting and systematic review 
of preclinical evidence. For instance, they 
advocate that the results of all preclinical 
studies — positive, negative and inconclu- 
sive — be reported in peer-reviewed journals 
and that investigators conduct a systematic 
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Researchers store human embryos donated for research in liquid nitrogen. 


review to capture all relevant information 
before initiating a clinical trial (see ‘On the 
up’). They also stipulate that trials begin 
only after investigators have achieved a high 
standard of safety and efficacy in relevant 
preclinical research, as determined by an 
independent peer-review process. 

The guidelines contain strong statements 
about transparency in clinical research and 
endorse the prospective registration of all 
trials in public databases, regardless of phase. 
They also call for complete and accurate 
reporting of results in accordance with stand- 
ards such as those provided by the CONSORT 
statement — an evidence-based set of recom- 
mendations for reporting randomized trials 
(see www.consort-statement.org). 

Despite vigorous admonitions by scien- 
tists and regulators against the premature 
clinical translation of stem-cell therapies’, 
numerous providers continue to deliver ill- 
defined cell preparations to patients outside 
of trials. In 2014, for instance, leaked docu- 
ments from an investigation showed that a 
provider of an unproven cell-based interven- 
tion in Italy had no mechanism for screening 
cells for pathogens, and that sections of its 
protocol had been lifted from Wikipedia. 

Such practices are worrisome. The 
adverse effects on people's health that may 
result threaten to set back more painstaking 
and methodical research programmes by 
increasing anxiety in ordinary citizens and in 
regulators about stem-cell-based medicines. 
The revised guidelines reiterate the condem- 
nation of such practices, and consider the 
proper testing of new stem-cell interventions 
in the context of rigorous trials a matter of 
professional responsibility. 

Increasingly, trials are being funded by 
patient communities, or people are paying 
to participate. Patient-funding may facili- 
tate some trials that wouldn't otherwise have 
been pursued — such as those aimed at help- 
ing people with rare conditions. But it also 
allows projects to bypass the long-established 


mechanisms of peer review and independent 
oversight that encourage scientific rigour and 
safety. The ISSCR guidelines stipulate that 
studies involving paying patients are per- 
missible only when they are subject to inde- 
pendent review mechanisms that can assess 
scientific rationale, priority and design. 

The marketing of unproven interventions 
has happened in part because of exagger- 
ated claims about stem-cell treatments in 
the media. Accordingly, the revised guide- 
lines describe ways in which researchers can 
responsibly communicate with the public, 
emphasizing the need for balance, clarity 
and the avoidance of unrealistic optimism. 


EVER EVOLVING 

The ISSCR’s 2016 guidelines are not intended 
to be the only or last word. They were devel- 
oped by 25 scientists and ethicists from 
Asia, Europe, North America and Australia, 
with review and feedback from more than 
100 individuals and organizations, including 
regulators, funding bodies, journal editors, 
patient advocates, researchers and members 
of the public. Some will consider aspects of 
the guidelines too permissive; others will find 
parts too restrictive. Moreover, some coun- 
tries have well-articulated policies that may 
supersede them. The UK Human Fertilisation 
and Embryology Authority, for instance, has 
well-developed guidance on the use of MRT. 

Even as we were finalizing the latest guide- 
lines, new ethical challenges were emerging 
— such as questions about biosafety; whether 
experimentally generated, self-organizing 
embryonic tissues should be treated in the 
same way as human embryos; and whether to 
revisit the “14-day rule’ that limits the cultur- 
ing of human-embryos in vitro to two weeks”. 
The guidelines are intended to be revisited as 
science and social priorities evolve. 

The many limitations of such an 
aspirational document notwithstanding, we 
believe that the ISSCR guidelines are well 
positioned to secure a common ethical basis 
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for stem-cell research. 


When widely embraced, international 
guidelines for professional conduct can be 
more effective than laws and regulations. The 
latter are confined to single jurisdictions, can 
be blunt regulatory instruments and change 
too slowly to keep pace with cutting-edge 
research. In highly politicized areas of science, 
legislated regulatory frameworks are at risk of 
being revised with every change in govern- 
ment, creating unpredictability that wastes 
resources and frustrates medical advances. 

Guidelines are not binding, but countries, 
funders, journals and academic institutions 
can incorporate them into their policies or 
use them to foster a culture of compliance. 
Backed by evidence and sound reasoning, 
they can even provide evidence of profes- 
sional standards to courts'’. Ultimately, 
guidelines give voice to norms and expec- 
tations regarding preclinical evidence, trial 
design and independent review, providing 
a structured basis for adjudicating disputes. 

The ISSCR guidelines continue the tradi- 
tion of scientists creating professional stand- 
ards for the responsible conduct of research. 
They speak most directly to those engaged 
in stem-cell research but are also relevant to 
regulators, journal editors, press officers, phy- 
sicians, funding bodies and patients. Such a 
global effort to establish research standards 
offers a model for other contentious research 
arenas — from artificial intelligence to climate 
engineering. 
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Driverless buses are being tested by an innovation consortium in Kista, Sweden. 


Twelve principles for 
open innovation 2.0 


Evolve governance structures, practices and metrics to accelerate innovation 
in an era of digital connectivity, writes Martin Curley. 


new mode of innovation is emerging 
At blurs the lines between univer- 

sities, industry, governments and 
communities. It exploits disruptive tech- 
nologies — such as cloud computing, the 
Internet of Things and big data — to solve 
societal challenges sustainably and profit- 
ably, and more quickly and ably than before. 
It is called open innovation 2.0 (ref. 1). 

The promise is sustainable, intelligent 
living: innovations drive economic growth 
and improve quality of life while reducing 
environmental impact and resource use. For 
example, a dynamic congestion-charging 
system can adjust traffic flow and offer 
incentives to use park-and-ride schemes, 
guided by real-time traffic levels and air 
quality. Car-to-car communication could 
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manage traffic to minimize transit times 
and emissions and eliminate road deaths 
from collisions. Smart electricity grids lower 
costs, integrate renewable energies and bal- 
ance loads. Health-care monitoring enables 
early interventions, improving life quality 
and reducing care costs. 

Such innovations are being tested in 
‘living labs’ in hundreds of cities. In Dublin, 
for example, the city council has partnered 
with my company, the technology firm 
Intel (of which I am a vice-president), to 
install a pilot network of sensors to improve 
flood management by measuring local rain 
fall and river levels, and detecting blocked 
drains. Eindhoven in the Netherlands is 
working with electronics firm Philips and 
others to develop intelligent street lighting. 
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Communications-technology firm Ericsson, 
the KTH Royal Institute of Technology, 
IBM and others are collaborating to test 
self-driving buses in Kista, Sweden. 

Yet many institutions and companies 
remain unaware of this radical shift. They 
often confuse invention and innovation. 
Invention is the creation of a technology or 
method. Innovation concerns the use of that 
technology or method to create value. The 
agile approaches needed for open innovation 
2.0 conflict with the ‘command and control 
organizations of the industrial age (see ‘How 
innovation modes have evolved’). Institu- 
tional or societal cultures can inhibit user and 
citizen involvement. Intellectual-property 
(IP) models may inhibit collaboration. Gov- 
ernment funders can stifle the emergence 
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of ideas by requiring that detailed descrip- 
tions of proposed work are specified before 
research can begin. Measures of success, such 
as citations, discount innovation and impact. 
Policymaking lags behind the market place. 

The challenge is how to execute and gov- 
ern the new mode. Innovation is a risky 
business that has high failure rates — 96% 
of all innovations do not return their capital 
cost, and 66% of new products fail within 
two years. But the potential benefits are vast. 
Innovation policies should recognize that the 
linear research-and-development model will 
be outpaced by a nonlinear, open and collab- 
orative innovation process where the mantra 
is ‘fail fast, scale fast’ 

Awareness of open “The 
innovation 2.0needstobe mantra 
raised across industry and is ‘fail 
society. Here I outline the fast, scale 
concept and how it works, fast’.” 
and offer a set of ‘design 
patterns’ — general solutions to common 
problems. Adopting these can accelerate 
the move from conceptual to concrete. The 
European Union's Open Innovation Strategy 
and Policy Group, of which I am a chair, is 
a global leader in distilling its knowledge in 
this way. The goal is that open innovation 2.0 
will become a discipline practised by many 
rather than an art mastered by few. 


DIVERSITY COUNTS 

Technical innovation is no longer solely the 
domain of lone scientists in labs. Scientists 
in the mid-twentieth century at Bell Labs in 
Murray Hill, New Jersey, are credited with 
inventing the transistor, the laser and the 
Unix computer operating system. Today, a 
more than one-third of the product launches 
by consumer-goods firm Procter & Gamble 
— including the fabric softener Bounce — 
emanate from ideas that started outside the 
company. 

The term open innovation — where ideas 
pass between different organizations to cre- 
ate value — was coined by organizational 
theorist Henry Chesbrough’ in 2003. Today, 
the concept is evolving fast. Driven by plum- 
meting communication costs and the ever 
increasing numbers of connected people and 
devices, it has never been so easy to exchange 
information and ideas. 

For example, InnoCentive, founded by 
US drug company Eli Lilly, is a website that 
matches problems posted by companies with 
scientists who can potentially solve them. So 
far, more than 59,000 solutions have been 
proposed in response to about 2,000 chal- 
lenges with US$48 million paid to solvers. 
The GreenTouch consortium, led by telecom- 
munications company Alcatel-Lucent, set a 
goal of community-sourcing ways to improve 
the energy efficiency of communication net- 
works by a factor of 1,000 by 2020. It delivered 
a road map for a factor of 10,000. Andthe > 


COMMENT 


TWELVE PATTERNS 


Keys to collaborative innovation 


| Purpose. Efforts and intellects 
aligned through commitment rather 

than compliance deliver an impact greater 
than the sum of their parts. A great example 
is former US President John F. Kennedy’s 
vision of putting a man on the Moon. 
Articulating a shared value‘ that can be 
created is important. A win-win scenario is 
more sustainable than a win-lose outcome. 


Partner. The ‘quadruple helix’ of 

government, industry, academia and 
citizens joining forces aligns goals, 
amplifies resources, attenuates risk and 
accelerates progress. A collaboration 
between Intel, University College London, 
Imperial College London and Innovate UK’s 
Future Cities Catapult is working in the Intel 
Collaborative Research Institute to improve 
people’s well-being In cities, for example to 
enable reduction of air pollution. 


Platform. An environment for 

collaboration is a basic 
requirement. Platforms should be 
integrated and modular, allowing a 
plug-and-play approach. They must be 
open to ensure low barriers to use, 
catalysing the evolution of a community. 
Challenges in security, standards, trust and 
privacy need to be addressed. For 
example, the Open Connectivity 
Foundation is securing interoperability for 
the Internet of Things. 


4 Possibilities. Returns may not come 
from a product but from the business 

model that enabled It, a better process ora 
new user experience. Strategic tools are 
available, such as industrial designer Larry 
Keeley’s breakdown of innovations into ten 
types in four categories: finance, process, 
offerings and delivery’. 


5 Plan. Adoption and scale should be 
the focus of innovation efforts, not 

product creation. Around 20% of value is 
created when an innovation is established; 
more than 80% comes when it is widely 
adopted’. Focus on the ‘four Us’: utility 
(value to the user); usability; user 
experience; and ubiquity (designing In 
network effects). 


5 Pyramid. Enable users to drive 
innovation®. They inspired two- 
thirds of innovations in semiconductors 
and printed circuit boards, for example. 
Lego Ideas encourages children and others 
to submit product proposals — submitters 
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must get 10,000 supporters for their idea 
to be reviewed. Successful inventors get 
1% of royalties. 


Problem. Most innovations come 

from a stated need. Ethnographic 
research with users, customers or the 
environment can identify problems and 
support brainstorming of solutions. Create 
a road map to ensure the shortest path to a 
solution. 


8 Prototype. Solutions need to be 
tested and improved through rapid 

experimentation with users and citizens. 
Prototyping shows how applicable a 
solution is, reduces the risks of failures and 
can reveal pain points. ‘Hackathons’, where 
developers come together to rapidly try 
things, are increasingly common. 


g Pilot. Projects need to be 

implemented in the real world on 
small scales first. The Intel Collaborative 
Research Institute runs research projects in 
London’s parks, neighbourhoods and 
schools. Barcelona’s Laboratori — which 
involves the quadruple helix — is 
pioneering open ‘living lab’ methods in the 
city to boost culture, knowledge, creativity 
and innovation. 


10 Product. Prototypes need to be 
converted into viable commercial 
products or services through scaling up and 
new infrastructure globally. Cloud 
computing allows even small start-ups to 
scale with volume, velocity and resilience. 


11 Product service systems. 
Organizations need to move from 

just delivering products to also delivering 
related services that improve sustainability 
as well as profitability. Rolls-Royce sells 
‘power by the hour’ — hours of flight time 
rather than jet engines — enabled by 
advanced telemetry. The ultimate goal of 
open innovation 2.0 is a circular or 
performance economy’, focused on 
services and reuse rather than 
consumption and waste. 


12 Process. Innovation is a team sport. 
Organizations, ecosystems and 
communities should measure, manage and 

improve their innovation processes to 
deliver results that are predictable, 
probable and profitable. Agile methods 
supported by automation shorten the time 
from idea to implementation. 


19 MAY 2016 | VOL 533 | NATURE | 315 


>» Innovation Value Institute at Maynooth 
University in Ireland worked internationally 
and with competing companies to develop a 
framework for measuring and improving IT 
capability, which is now used by hundreds of 
organizations worldwide. 

Companies are opening up their research 
labs. Philips has converted its research facil- 
ity in Eindhoven, which had 2,400 employ- 
ees in 2001, to an open research campus 
(High Tech Campus Eindhoven) that now 
houses more than 140 firms and around 
10,000 researchers. Breakthrough ideas 
often emerge at the intersection of disci- 
plines. For example, Keenan, an Irish agri- 
cultural-equipment supplier, and telephone 
company Vodafone have worked with Intel 
to develop an online service that uses real- 
time information to provide farmers with 
nutritional advice for livestock. 

Open innovation 2.0 is neither easy nor is it 
a panacea. It requires courage and energy. But 
once a critical mass is achieved, innovation 
can catalyse itself’. Just as momentum is the 
product of mass and velocity, the ecosystem 
with the most participants and fastest turno- 
ver of ideas will be the most successful. Par- 
ticipating organizations must create synergies 
rather than cancel each other out. High levels 
of trust and conviction in the shared vision are 
predictors for eventual success. 

A common language helps. Just as 
architects and engineers can refer to canon- 
ical designs when building a bridge, social 
and technological innovators can improve 
productivity by following design patterns. 
These heuristics summarize insights about 
the innovation process and can be combined 
(see ‘Keys to collaborative innovation ). They 
shorten learning times and improve the 
results and pace of innovation. 


NEXT STEPS 

We have all witnessed how the music and 
book industries have been transformed 
by companies such as Apple and Amazon 
through digitization. Transforming cities, 


Intelligent-lighting sensors can collect useful 
data about urban activities for city planners. 
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HOW INNOVATION MODES HAVE EVOLVED 
Closed innovation 
Dependency 
Subcontracting 
Solo Bilateral 
Linear 

Linear subcontracts Bilateral 
Planning 
Control 


Management 


Win-lose game 


Box thinking Out of the box 
Single entity Single discipline 
Value chain Value network 


energy grids and health-care systems will 
be harder. It will need technology break- 
throughs, alignment of interests, investment 
and collaboration across many stakeholders. 

Different sectors should explore how dis- 
ruptive technologies can transform their 
domain. For example in health care, wirelessly 
transmitting information from electrocardio- 
grams (a heart-activity test), and using cloud- 
based analytics, could detect the early onset of 
conditions such as cardiac fibrillation. 

Governments should create innovation 
strategies that build capacity and focus efforts 
on nationally important problems. Road 
maps that set out a sequence of problems to be 
solved and the innovations required — such 
as the International Technology Roadmap 
for Semiconductors — can direct efforts effi- 
ciently. Imagine the power of a European or 
globally agreed road map for smarter cities, 
more sustainable electrical grids or health- 
care transformation. The risks of collabo- 
ration can be reduced by publishing model 
contracts for public-private partnerships. 
Measurement instruments such as Europe's 
Innovation Union Scoreboard can identify 
which areas of a country’s innovation ecosys- 
tem need to be strengthened. 

The EU should expand its ambition 
beyond creating a European Research Area 
to nurturing a European Innovation Ecosys- 
tem. It is promising that a priority for the 
current Dutch presidency of the EU is ensur- 
ing that all European funding instruments 
focus more on innovation. EU research com- 
missioner Carlos Moedas proposed that the 
establishment of a European Innovation 
Council (EIC) would be an important inter- 
vention. The EIC could be the steward of an 
overall European innovation strategy and of 
societal-challenge road maps. 

Governments should encourage the adop- 
tion of innovations as well as their creation. 
They should modulate spending, incen- 
tives and policy depending on technology 
maturity. Immature technologies with high 
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Open innovation 


Independency 


Cross-licensing 


Linear, leaking 


Validation, pilots 


Win-win game 


Open innovation 2.0 
Interdependency 
Cross-fertilization 
Ecosystem 

Nonlinear mash-up 
Triple or quadruple helix 
Experimentation 
Orchestration 

Win more-win more 

No boxes! 


Interdisciplinary 


Value constellation 


potential need research and pilot projects 
funded; more-proven prototypes require 
support for capital deployment cost as well 
as incentives such as tax credits or loan guar- 
antees. For example, in the United States, 
incentive payments were used to encourage 
physicians to adopt electronic health records. 
Leaders should note the public's increasing 
desire to be involved in innovation efforts, 
as exemplified by citizen-science initiatives. 

A cultural shift is at the core of open 
innovation 2.0. The technology is ready — 
are we? 


Martin Curley is vice-president at Intel 
Corporation and director of Intel Labs 
Europe, based in Dublin, Ireland. 
e-mail: martin.g.curley@intel.com 


1. Curley, M. & Salmelin, B. Open Innovation 2.0 —A 
New Paradigm (EU Open Innovation and Strategy 
Policy Group, 2013). 

2. Chesbrough, H. Open Innovation: The New 
Growth for Creating and Profiting from Technology 
(Harvard Business School Press, 2003). 

3. Curley, M. & Formica, P. (eds) The Experimental 
Nature of New Venture Creation: Capitalizing on 
Open Innovation 2.0 (Springer, 2013). 

4. Porter, M. E. & Kramer M. R. ‘Creating Shared 
Value’ Harvard Bus. Rev. (January-February 2011). 

5. Gawer, A. (ed) Platforms, Markets and Innovation 
(Edward Elgar, 2011). 

6. Keeley, L., Walters, H., Pikkel, R. & Quinn, B. Ten 
Types of Innovation: The Discipline of Building 
Breakthroughs (Wiley, 2013). 

7. Rogers, E. M Diffusion of Innovations 5th edn 
(Simon and Schuster, 2003). 

8. Von Hippel, E. The Sources of Innovation (Oxford 
University Press, 1988). 

9. Stahel, W. The Performance Economy 2nd edn 
(Palgrave Macmillan, 2010). 


CORRECTION 

The Comment ‘Seven chemical 
separations to change the world’ 

(D. S. Sholl and R. P. Lively Nature 532, 
435-437; 2016) gave the incorrect units 
for atmospheric distillation. It should have 
read 230 GW globally. 
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ECOLOGY 


The sea-otter whisperer 


Jane Lubchenco applauds James Estes’s chronicle of his 45 years studying the 
complexities of an apex predator. 


he sea otter (Enhydra lutris) has hada 

turbulent history. A cultural icon for 

tribes living around the coast of the 
North Pacific Ocean, it became an irresist- 
ible target for fur traders who almost drove 
it to extinction, triggering an international 
hunting ban in 1911. Then, as the species 
began to recover in parts of its former range, 
it became a magnet for tourists, a symbol of 
hope for marine conservation and the equiv- 
alent of Darwin's finches for one scientist: 
ecologist James Estes. Sea otters are now in 
decline again in most areas. Estes has relent- 
lessly unpacked insights into the species’ 
history and ecological complexities for more 
than four decades, in one of the most remote 
places on Earth. 

In Serendipity, Estes chronicles that 
research — in Alaska’s Aleutian Islands and 
beyond — on this relatively small marine 
mammal, its voracious appetite for crabs, 
clams, fish and especially sea urchins, and 
its consequent ability to control the very 
presence or absence of entire kelp forests 
and their inhabitants. 

In his refreshingly honest and humble 
narrative, Estes shares the intriguing insights 
that he gained through careful observation, 
intentional perturbations of environmental 
conditions, long-term monitoring, rigorous 
analysis, willingness to collaborate and open- 
ness to new ideas. He acknowledges the often 
circuitous pathways and serendipitous events 
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that yielded new ideas to test. For example, 
he relates a pivotal conversation with ecolo- 
gist Robert Paine, who coined the concept of 
keystone species, which have a dispropor- 
tionately large effect on their ecosystems (see 
Nature 493, 286-289; 2013). That led Estes 
to pose what would become the defining 
question of his career — “What impact do sea 
otters have on kelp forests?’ — rather than his 
initial query, “What impact do kelp forests 
have on sea otters?’ 

The study of top-down control of an eco- 
system by an apex predator and the use of 
perturbations to test ecological hypotheses 
were in their infancy in the 1970s, when Estes 
began his Alaska studies. The Aleutian otters’ 
ongoing recovery from near-extinction 
provided the perfect 
mosaic of conditions: [ ~~ == 
in some places, there | SERENDIP] Ty 
were many Otters; 1 alias 
others, there were | 
few; and in yet others, 
otters were still absent. 
Estes’ early work dem- 
onstrated conclusively 
that otters in Alaska 
did control much of 
the rest of the near- 
shore community by 
eating sea urchins, the 
primary consumers of 
kelps, and controlling 


Serendipity: An 
Ecologist’s Quest 
to Understand 
Nature 

JAMES A. ESTES 
University of California 
Press: 2016. 
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their abundance. Where otters are present, so 
too are kelp forests and the plethora of other 
species (fish, crabs, starfish, mussels, snails, 
seals, seaweeds, birds and more) that depend 
on the plants for habitat or food. Where otters 
are absent, urchins thrive, gobble up kelp and 
keep the seascape barren. 

Estes and his colleagues went on to dem- 
onstrate multiple aspects of ‘trophic cascades, 
in which an apex predator's influence ripples 
through the food web through complex, indi- 
rect interactions. Thanks to the team’s work, 
we know that the diets of gulls and bald eagles, 
the abundance of fish, the growth of mussels 
and the size and abundance of starfish are 
all strongly influenced by the presence or 
absence of otters in Aleutian kelp forests. 

Some of the most intriguing ideas that 
emerged from Estes’s studies focus on the 
surprising and precipitous decline of Aleu- 
tian sea otters that began in the early 1990s. 
Observations of increasing numbers of 
killer whales, or orcas (Orcinus orca), and 
of killer whales preying on otters, coupled 
with calculations about otter demography 
and killer-whale energetics, led to a startling 
suggestion by Estes and his colleagues: preda- 
tion by killer whales was driving down otter 
populations. The team’s subsequent hypoth- 
esis was even more surprising. 

Realizing that something must have 
changed in the open sea to drive killer whales 
from their normal oceanic habitats to coastal 
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ones, Estes and his colleagues began to con- 
sider broader-scale events in space and time. 
They knew that after the Second World War, 
intense industrial whaling in the North 
Pacific Ocean had reduced whale biomass 
‘sevenfold to eightfold; up until implementa- 
tion of the international ban on commercial 
whaling in the late 1960s and early 1970s. 
They reasoned that if killer whales had pre- 
viously fed on great whales, the demise of the 
latter might have forced killer whales to for- 
age more widely and switch to alternate prey. 

Piecing together data from the seas up to 
370 kilometres around the Aleutian archi- 
pelago, they presented a compelling hypoth- 
esis of sequential collapses that spanned 
four decades. First, great-whale populations 
plummeted because of whaling. Then, killer 
whales switched to and decimated first one 
species, and then another, causing the col- 
lapse of populations of harbour seals (Phoca 
vitulina), then Steller sea lions (Eumetopias 
jubatus) and finally sea otters. This ‘mega- 
faunal collapse hypothesis, articulated by 
Estes, Alan Springer and their colleagues, was 
not received warmly by some scientists who 
had been studying individual species, but 
evidence for it continues to mount. Moreo- 
ver, these ideas raise more intriguing ques- 
tions: as at least some great whales recover 
from decimation by whaling, what will killer 
whales prey on? And what will then happen 
to seals, sea lions, otters and kelp-forest com- 
munities? Let us hope that Estes will continue 
to provide insights. 

Many of the findings in the book, espe- 
cially the early ones, are classics of ecology. 
For non-specialists, Estes’s account is a 
fascinating introduction to key ecological 

concepts. For the 


“This relatively cognoscenti and 


small marine for students seek- 
mammal has ing understand- 
an ability to ing of how science 
controlthevery 2¢tually works, 
presence or Serendipity is an 
absence of entre honsibehind the 
kelp forests. ell een 


the Aleutians and 
the Pacific more broadly, and what a now- 
esteemed scientist was thinking and feeling 
along the way. A rare and delightful insight 
into timely science. m 


Jane Lubchenco is a distinguished professor 
at Oregon State University in Corvallis. A 
marine ecologist and environmental scientist, 
she was administrator of the US National 
Oceanic and Atmospheric Administration 
from 2009 to 2013, and the first US Science 
Envoy for the Ocean from 2014 to 2016. 
e-mail: lubchenco@oregonstate.edu 

The views expressed are solely those of the author 


and not necessarily those of the US Department 
of State. 
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Books in brief 


What We Cannot Know: Explorations at the Edge of Knowledge 
Marcus du Sautoy FOURTH ESTATE (2016) 

Science bristles with ‘known unknowns’, from dark matter to the 
origins of life. But are some of these conundrums unknowable? In 
this finely synthesized study, mathematician Marcus du Sautoy 
explores the edges of our understanding in maths, mind and 
beyond. Each spiralling investigation begins with an object: casino 
dice kick-start a foray into probability; a wristwatch propels us into 
grappling with time. A dazzling journey, vivified by conversations 
with the likes of neuroscientist Christof Koch on psychophysics and 
cosmologist Max Tegmark on the mathematical Universe. 


The New Worlds of Thomas Robert Malthus: Rereading the 
Principle of Population 

Alison Bashford and Joyce E. Chaplin PRINCETON UNIVERSITY PRESS (2016) 
“For more than two hundred years, people have loved to hate 
Thomas Robert Malthus,” declare historians Alison Bashford and 
Joyce Chaplin. In their penetrating reappraisal of the philosopher’s 
Essay on the Principle of Population, first published in 1798, they 
argue that his theory of population growth outstripping agricultural 
production has been decontextualized. He was less a critic of the 
English poor than of a swelling European population in his era of 
colonization, and its impact on people in the New World. 


The Five Horsemen of the Modern World: Climate, Food, Water, 
Disease, and Obesity 

Daniel Callahan COLUMBIA UNIVERSITY PRESS (2016) 

In this trenchant treatise, bioethicist Daniel Callahan investigates 
why five global issues — water and food shortages, climate change, 
disease and obesity — remain intractable, despite the billions of 
dollars thrown at them. Callahan examines cross-cutting problems 
such as an ageing population, and analyses the role of policy and 
governance. To ramp up the “intensity level” needed for solutions, he 
envisions coordinated efforts by grass-roots activists, corporations 
and national and international agencies. Pragmatic and measured. 


Age of Discovery: Navigating the Risks and Rewards of Our 

New Renaissance 

lan Goldin and Chris Kutarna BLOOMSBURY INFORMATION (2016) 

The grit and glory of the Renaissance is mirrored in our own age, 
from looming risks such as pandemics to the transcendent genius 
needed to overcome them. So aver globalization expert lan Goldin 
and political scientist Chris Kutarna in this bold mega-analysis 

of global education, health, prosperity and technology, then and 
now. Their parsing of revolutions — from the introduction of the 
Copernican principle to the digital explosion — is incisive and rich in 
context and granularity, but not every sweeping statement holds up. 


In the Bonesetter’s Waiting Room: Travels Through Indian Medicine 
Aarathi Prasad PROFILE (2016) 

Indian medicine is a complex landscape where six traditional 
systems jostle with Western biomedicine. Science journalist Aarathi 
Prasad’s focused and fascinating journey through the intricacies 
takes her to Mumbai’s sprawling slum Dharavi, where sanghinis, or 
female counsellors, help the hordes of local women subjected to rape 
and domestic violence. She also visits a clinic that integrates Western 
labwork with ancient Ayurvedic practice; a project mixing paediatric 
eye surgery with neuroscience research; and much more. Barbara Kiser 
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The San Francisco Museum of Modern Art’s extension more than doubles its exhibition space. 
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California on camera 


John Gilbey lauds San Francisco’s vastly expanded 
showcase for modern art and photography. 


mid-twentieth-century science- 
Aizen writer’s image of the City of 

the Future: that is how the approach 
to the San Francisco Museum of Modern 
Art (SEFMOMA) in California can seem. 
On a clear spring afternoon, the black and 
white stone oculus of the Mario Botta- 
designed building emerges from the foot- 
hills of the Financial District like the prow 
of a grounded starship. Now, a new addition 
more than doubles SEMOMAs exhibition 
space. The expansion has allowed SDMOMA 
to remodel public spaces, conservation 
resources and study areas for a new genera- 
tion of researchers and visitors. 

SFMOMA has had a strong track record 
in covering the diversity of Bay Area tech- 
nological innovation and artistic richness. 
The exhibition 010101: Art in Technologi- 
cal Times, launched in 2001, featured sound 
installations by musician Brian Eno, as well as 
video and live digital landscapes — ephem- 
eral work that also brought curatorial chal- 
lenges. In 2014, the museum showcased geek 
power at its Art + Data Day, a hackathon that 
challenged collaborative teams to interro- 
gate the museum collection metadata. Now, 
with the expansion, SFMOMA boasts the 
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Reopening 
San Francisco Museum of Modern Art, California. 
14 May. 


California and the West 
San Francisco Museum of Modern Art, California. 
14 May— 5 September. 


Pritzker Center for Photography, which offers 
1,400 square metres to display the museum's 
renowned photography holdings, including 
iconic works by nineteenth- and twentieth- 
century greats such as Carleton Watkins and 
Ansel Adams. It is now the largest US gallery 
space dedicated to the technology. 

White and multifaceted, the wing’s rip- 
pled fibreglass-reinforced polymer cladding 
— designed to echo the Bay Area’s famous 
fogs — has a dynamic look in the chang- 
ing light. The transdisciplinary approach 
of its innovative architects Snghetta inte- 
grates architectural, landscape and interior 
design; their portfolio includes the Biblio- 
theca Alexandrina in Alexandria, Egypt. The 
edifice has been costly both financially — at 
US$305 million — and practically, because 
the museum building has been closed since 
June 2013. Until recently, red hoardings 
enjoined unlucky visitors to “Think Outside 
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the Building” — a reference to its successful 
touring outreach programme, SFMOMA On 
The Go. 

Inside, the ample room for photography 
displays gives the museum's 17,000-piece 
collection real breathing space. SFMOMA 
has been amassing the holdings, which 
range from early Daguerreotypes to cutting- 
edge digital images, since it opened in 1935. 
This permanent collection is categorized by 
themes that speak to the unique geographic 
and cultural position of the city. The local 
heroes of the Group f/64 collective — Adams, 
Edward Weston, Imogen Cunningham — are 
inevitably strongly represented. But the col- 
lection of Japanese photography, perhaps 
the finest outside Japan, demonstrates how 
San Francisco considers itself to be a Pacific 
community. Modernist and Surrealist pho- 
tographers such as Alfred Stieglitz and Man 
Ray also feature. 

Senior curator of photography Sandra Phil- 
lips talked to me about the technology's spe- 
cial place in San Francisco — blessed with a 
photogenic mix of topography, microclimate 
and infrastructure — and in California, a state 
endowed with rare geological and ecological 
richness. Pioneering regional photographers 
such as Watkins and Adams created a legacy 
of iconic monochromes. The expansion, Phil- 
lips told me, allows for more study as well as 
display of this deep resource: the Photography 
Interpretive Center, for instance, enables visi- 
tors to explore the lives and work of photog- 
raphers through video interviews and films. 

Phillips is curator of California and the 
West, an exhibition that traces the portrayal 
of Californian wilderness and land develop- 
ment in photographs over time. The show, 
she notes, “will reveal the evolution in our 
understanding of land use, first as develop- 
ment in mining and logging, then as protec- 
tion”. Along with the work of Robert Adams, 
who as part of the 1970s New Topographics 
movement revealed the regional realities of 
urban encroachment, there will be images 
by early masters — Dorothea Lange, Minor 
White and Group f/64. A more contempo- 
rary lens on the landscape is provided by 
Richard Misrach, Larry Sultan, Michael 
Light and Trevor Paglen. 

The expansion offers SFMOMA a way to 
redefine the role of a modern art museum 
in a city poised between a vast, sometimes 
invasive, high-tech culture and an older, 
perhaps simpler, tradition in a technological 
medium. And by presenting itself'as a public, 
eclectic resource offering free entry in perpe- 
tuity to under- 18s, it is setting off beneficial 
ripples in local culture. Today’s young visi- 
tors are the digital natives who will shape the 
future of both technology and the arts. = 


John Gilbey is a science and science-fiction 
writer and photographer. 
Twitter: @John_Gilbey 
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Prevent misuse of 
eco-compensation 


China is piloting a new type of 
ecological compensation scheme 
in which developers pay to offset 
environmental damage. We are 
concerned that this policy could 
have the opposite effect. 

Ecological compensation by 
developers in northwest Europe 
must restore natural habitats 
and resources to maintain 
ecological functions and services 
(see E. Wolanski and M. Elliott 
Estuarine Ecohydrology; Elsevier, 
2015). By contrast, ecological 
restoration is not a priority under 
Chinas pilot scheme. Developers 
may choose to pay government 
authorities directly rather than 
opting to restore ecosystems, 
which is generally more 
expensive, time-consuming and 
scientifically demanding. 

We are concerned that the 
payment system could be 
misused by treating it as ‘buying’ 
the right to inflict further 
damage. We therefore urge 
the government to develop an 
ecological compensation policy 
that is conditional on ecological 
restoration. 

Qinhua Fang Xiamen University, 
China. 

Michael Elliott University of 
Hull, UK. 

ghfang@xmu.edu.cn 


Speed publication of 
self-corrections 


Kudos to Daniele Fanelli for 
suggesting that authors deserve 
credit for retracting papers 

that result from honest errors 
(Nature 531, 415; 2016). Journal 
editors and co-authors can 

be helpful, collaborative and 
efficient — as they were in the 
case of two papers I retracted 
in 2013 (see go.nature.com/ 
jp9meh). However, this is not 
always the case. 

The editorial process can 
often be lengthy. After waiting 
for more than two years for the 
formal resolution of a potential 
notice of error for another, 


related paper, I decided to use 
PubMed Commons to alert 
the scientific community (see 
go.nature.com/jzpjfk). 

Authors themselves can be 
uncertain about how to correct 
the scientific record promptly, 
even when they suspect errors. 
They might have moved on to 
new projects, key contributors 
may have left the lab, or perhaps 
they are reluctant to use precious 
funds to repeat old experiments. 
Maybe the materials that are 
needed to repeat the experiments 
are no longer available, or the 
authors are slow to reach a 
consensus on appropriate action. 

In my view, these problems 
could be rectified by standard 
guidelines for researchers, editors 
and databases on how to handle 
self-reported amendments. 
Terms that are more nuanced 
than ‘correction or ‘retraction 
— such as ‘notice of concern’ or 
‘error alert’ — could be used for 
situations that are less clear-cut. 
Pamela Ronald University of 
California, Davis, USA. 
pcronald@ucdavis.edu 


EU list should add 
potential invasives 


To be effective, the list of invasive 
alien species that are targeted for 
action under the latest European 
Union regulation needs to focus 
on preventing new species from 
entering Europe, and to impede 
the spread of those that currently 
have a limited distribution (see 
also J. Pergl et al. Nature 531, 
173; 2016, and F. Santarém et al. 
Nature 532, 177; 2016). 

To be listed under the 
regulation, species must have 
been fully risk-assessed. However, 
existing assessments are mainly 
for species that are already 
widespread. All but 2 of the 37 
invasive species on the 2015 EU 
list (see go.nature.com/gigftz) 
are present in Europe, and 14 are 
widely dispersed. Of the 95 species 
that have been identified 
as potentially invasive (see 
go.nature.com/avgfb1), only the 
South American raccoon (Nasua 


nasua) is on the present list. 

In our view, post-invasion 
management is likely to be more 
successful and cost-effective if 
the legislation’s ‘black list’ were to 
focus on potential future invaders 
as well as on mitigating the 
impacts of existing invaders. 
Maiju Lehtiniemi* Finnish 
Environment Institute, Helsinki, 
Finland. 
maiju.lehtiniemi@ymparisto.fi 
*On behalf of 4 correspondents (see 
go.nature.com/jbvxnz for full list). 


Nicaragua Canal may 
not benefit shipping 


We contend that the benefits of 
the proposed Nicaragua Canal 
cannot justify the irreversible 
damage that it would cause to the 
environment (see A. Meyer and 
J. A. Huete-Pérez Nature 506, 
287-289; 2014). 

With the Panama Canal’s 
widening due to be completed 
in the next month or so, the 
argument that the Nicaragua 
Canal could better accommodate 
large container ships is no 
longer valid. Such vessels are of 
marginal significance to trans- 
Pacific shipping anyway, owing 
to insufficient freight demand 
and port limitations. And as 
manufacturing bases shift from 
China to south and southeast 
Asia, more of Asias outbound 
container ships use the Suez 
Canal to reach eastern US ports. 

The Nicaragua Canal would 
be much longer than the 
Panama Canal, and so would 
probably incur higher fees for 
bulk carriers, tankers and other 
vessels. On routes that link North 
America with Asia, large vessels 
already tend to sail around South 
Africas Cape of Good Hope to 
avoid canal fees. 

The Nicaraguan government 
has also underestimated the safety 
risks. The country experiences 
frequent earthquakes, hurricanes 
and volcanic eruptions — which 
led the United States to select 
Panama over Nicaragua fora 
transoceanic canal in the first 
place. 


Jihong Chen Shanghai Maritime 
University, China. 

Xiang Liu Rutgers University, 
Piscataway, New Jersey, USA. 
jhchen@shmtu.edu.cn 


Clean up the air 
in Kathmandu 


The rebuilding and development 
of Kathmandu’s infrastructure 
after last year’s earthquake in 
Nepal offer a unique opportunity 
to improve the city’s air quality. 
According to one ranking, 
Kathmandu is currently Asia’s 
most polluted city (see go.nature. 
com/csbtlc). 

Kathmandu’ air pollution 
is increasing uncontrollably 
owing to a lack of environmental 
legislation in the face of 
haphazard expansion. The 
number of vehicles has tripled 
over the past ten years, and toxic 
particulates are now “far above” 
World Health Organization 
guidelines (A. Gurung and 
M.L. Bell J. Expo. Sci. Environ. 
Epidemiol. 22, 235-242; 2012). 
Unregulated industrial activity, 
dust from the congested road 
system and open burning of solid 
waste all add to the pollution. 
Moreover, Kathmandu lies in a 
valley, the topography of which 
causes temperature inversions 
that trap this polluted air — 
making it barely breathable on 
occasions. 

I urge the government 
to extend its clean-up and 
restoration beyond buildings, 
temples and monuments and to 
draw up stringent environmental 
regulations. The valley's 
3 million inhabitants matter, as 
well as its tourists. 
Keshav Parajuly University of 
Southern Denmark, Odense, 
Denmark. 


kepa@kbm.sdu.dk 
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GO Nat OBITUARY 


Harry Elderfield 


(1943-2016) 


Geochemist who deciphered chemical signatures in the modern and ancient oceans. 


enry ‘Harry Elderfield showed how 
H:: distributions of chemicals in 
seawater and sediments can reveal 
the ocean's role in historical climate change. 

Elderfield, who died on 19 April, was 
born in 1943, at the height of the Second 
World War, in North Yorkshire, UK. A 
few days before his birth, his father, Henry, 
was reported missing in action, presumed 
drowned, a loss that may have contributed 
to Elderfield’s draw to the oceans as well as to 
the past. After obtaining a degree in chem- 
istry from the University of Liverpool, UK, 
in 1965, he completed a postdoctoral fellow- 
ship at Imperial College London, in 1969, 
and started a lectureship at the University of 
Leeds, UK, even before completing his PhD. 

In 1977, Elderfield headed to the Uni- 
versity of Rhode Island in Kingston for a 
sabbatical year. There he met geochemists 
Gary Klinkhammer, John Edmond and 
Wallace Broecker. Discussions with these 
scientists would have a major influence on 
the research questions that he later pursued. 

Back at Leeds, Elderfield’s laboratory 
technician, Mervyn Greaves, was told to 
make himself useful in the lab of another 
faculty member, Chris Hawkesworth. 
Hawkesworth had established a way to ana- 
lyse rare-earth metals — a group of 17 ele- 
ments — in 1-gram rock samples. Elderfield’s 
biggest scientific splash came when he and 
Greaves applied a modified version of this 
method to 50-litre samples of seawater and 
made the first measurements of the tiny con- 
centrations of ten rare earths from samples 
collected from the North Atlantic. They were 
present in picomolar amounts (on the order 
of 10°’ moles per kilogram). 

Elderfield established how the composi- 
tion of rare earths in a sample, and the specific 
properties of the elements, could be used to 
identify different water masses — deep water 
bodies from different parts of the ocean that 
have distinct physical properties. He also pre- 
dicted — correctly — that these signatures 
could be extracted from the ocean sediments 
covering the sea floor and used to track how 
ocean circulation has changed over time. (The 
composition of rare earths in the sediment of 
a particular geological age reflects the compo- 
sition of the elements in the ocean water at the 
time that the layer was formed.) 

In 1982, Elderfield moved to St Catharine's 
College at the University of Cambridge, UK. 
Here he was one of the first to realize how 
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isotopes that decay radioactively over time 
could be used to solve marine geochemical 
puzzles. With Martin Palmer, who had been 
a student of Elderfield’s at Leeds, he developed 
the strontium isotope curve for seawater for 
the past 75 million years. Analysing the ratios 
of strontium isotopes taken from sediments 
of known age, the pair mapped the changes 
in the isotopic ratio in the ocean over time. 
This isotopic curve, recorded globally in 
limestone, shows the shifting effects of con- 
tinental weathering and hydrothermal vents. 
These sea-floor features spew spectacular 
plumes of geothermally heated water and 
chemicals into the ocean, and are the major 
suppliers of strontium to the ocean. Stron- 
tium isotope stratigraphy, a term coined by 
Elderfield to describe the mapping of stron- 
tium isotopes in layers of rock, is now widely 
used to date sediments. It has also proved 
useful for climate scientists. Weathering 
pulls carbon dioxide out of the atmosphere 
and vents release the gas, so the strontium 
isotope curve provides a proxy for tectonic 
processes that are crucial to the carbon cycle. 
In 1985, Elderfield’s group participated in 
a cruise that produced the first photographs 
of a hydrothermal vent called the TAG vent 
site (P. A. Rona et al. Nature 321, 33-37; 
1986). This was the first vent discovered in 
the Atlantic Ocean or indeed on any slow- 
spreading mid-ocean ridge. (In subsequent 
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years, Elderfield explored the role of vents 
in ocean chemistry, as well as fluid flow 
through the sea floor.) 

Around the same time, Elderfield and 
Edmond, who then was at the Massachusetts 
Institute of Technology (MIT) in Cambridge, 
arranged sabbaticals in each others labs. But 
as a result of slightly haphazard planning they 
never overlapped in the same place. At MIT, 
Elderfield instead spent much of his time 
with the palaeochemist Edward Boyle, who 
was then trying to relate the chemistry of the 
shells of foraminifera, single-celled organ- 
isms known as protists, to the chemistry of 
the seawater in which they lived. 

In the shell material, foraminiferal calcite, 
trace metals of similar size sometimes replace 
calcium; even trace anions can take the place 
of the carbonate ion. Elderfield pioneered 
the exploitation of these ‘mineralogical mis- 
takes’ to gain insight into past ocean compo- 
sitions. He established, for instance, that the 
temperature of the ocean at the time that the 
foraminifera formed can be inferred from the 
magnesium-calcium ratio in shells. This ratio 
provides a way to distinguish between the 
shifts in oxygen isotopes in the shells caused 
by changes in ocean temperature from those 
caused by changes in the size of Earth’s polar 
ice sheets. Ultimately, Elderfield helped to 
reveal the independent history of changes 
in global temperature from the waxing and 
waning of Earth’s ice sheets over thousands 
or millions of years. 

Harry was a warm, playful, family man. 
He had a mischievous sense of humour and 
a taste for fine art, music, wine and unusual 
spirits. He also had an incredible knack of 
teasing sense out of disorder — or in his 
words, of finding the “really interesting” 
trends in “unascribable scatter”. 

I was one of his PhD students in the 1990s. 
On one tumultuous day I discovered that 
another group had reported findings similar 
to those at the heart of my PhD. Harry dis- 
sipated my fears with one sentence: “It just 
shows you are doing exciting science.’ After 
a pause anda wry smile, he added, “But let's 
do it better”. = 


Rosalind E. M. Rickaby is professor 

of biogeochemistry at the University of 
Oxford, UK. Harry Elderfield was her PhD 
supervisor at the University of Cambridge, 
UK, from 1995 to 1999. 

e-mail: rosalind.rickaby@earth.ox.ac.uk 
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No observable conformational changes in PSII 


ARISING FROM C. Kupitz et a/. Nature 513, 261-265 (2014); doi:10.1038/nature13453 


It is well established! that light flashes produce a succession of oxida- 
tion events in photosystem II (PSII), driving an initially dark-equil- 
ibrated sample (the S state) through a series of states (S, and S3) 
that represent the accumulation of oxidizing equivalents at the 
Mn-containing oxygen-evolving complex (OEC), while electrons are 
transferred to a remote acceptor site. Kupitz et al.” recently claimed 
that significant photoinduced conformational changes are observed 
in PSII using ‘diffract before destroy’ X-ray crystallography tech- 
niques at an X-ray free electron laser (XFEL) source. Although crys- 
tallography experiments can potentially give mechanistic insights into 
PSII-mediated water splitting, the authors’ illumination conditions 
are not suited to move a significant fraction of the system into the 
S3 state, and their claim of conformational differences between the 
S; state and the two-flash sample is not supported by their crystal- 
lographic data. 

Stable charge separation after light absorption is a prerequisite 
for S-state advancement of the OEC. Owing to the slow acceptor- 
side kinetics for the final electron transfer step from quinones 
Qa to Qgz, a delay between flashes on the order of >100 ms is rou- 
tinely used for quantitative formation of the higher S states. Although 
no direct kinetic data are available for the crystals used by 
Kupitz et al.?, rate constants are available for same-species PSII 
preparations**. On the basis of purified protein, the 2101s flash- 
spacing used in the XFEL experiment? can yield at best 20% of 
the centres being populated in the S; state at the moment of the 
XFEL probe, 5701s after the second flash*~°. Alternatively, using 
measured rate constants for intact thylakoids or membrane 
fragments®, at most 23% of the centres have reoxidized Qa 2101s 
after the first light flash and are able to undergo stable charge sepa- 
ration after the second flash to form the S; state. The reported EPR 
data (extended data fig. 1f, g in ref. 2) were recorded with a flash- 
spacing of 1s and therefore do not indicate successful S-state turno- 
ver with the almost 5,000-fold shorter flash-spacing used during the 
XFEL experiments. 

The analysis of the crystallographic electron density is problem- 
atic for several reasons. First, no mF, — DF, difference maps were 
reported, although such maps are the most indicative of small changes 
in the OEC. Second, inspection of the deposited structures (Protein 
Data Bank (PDB) accession codes 4PBU and 4Q54) revealed that the 
authors transposed the two PSII monomers in the two-flash state 
relative to the dark state. As a result, the differences interpreted to 
arise from flash states could have arisen from the differing crystal 
environments between the two monomers. Third, the changes in the 
density surrounding the OEC presented in fig. 3 of ref. 2 are not 
reproducible using most well-validated methods for map calcula- 
tion (with or without bias removal; Fig. 1a). Only when generating 
simulated annealed (SA) omit maps exactly as published’ using 
phenix.autobuild (a program that was never intended to reveal fine 
structural details) were we able to reproduce partially the claimed 
changes in the region of the OEC, but only for one of the two mon- 
omers (Fig. 1b). However, these maps are very sensitive to uncer- 
tainties in the atomic model used for phasing, and inspecting a 
slightly larger area of the SA-omit map produced by the protocol in 
ref. 2 reveals numerous areas with similar differences between the 
light and dark states (ellipses in Fig. 2). If these were true structural 
changes, one would conclude that the entire protein backbone is 
modified by illumination. However, the most likely interpretation 


is that the differences are below the significance level of the experi- 
ment, given the limited resolution (5.0 and 5.5 A) of the diffraction 
data. 

In summary, femtosecond X-ray crystallography at XFEL sources 
promises to provide great insights into complex biological systems such 
as PSI. However, every care has to be taken to ensure that the correct 
states are being studied and that the interpretation is consistent with 
the level of uncertainty in the data. These requirements are not fulfilled 
in the work by Kupitz et al.” 


Monomer 2 


Monomer 1 


Dark (4PBU) 


Two-flash (4Q54/4RVY) 


Ss 


Dark (4PBU) 


Two-flash (4Q54/4RVY) 


Figure 1 | Reanalysis of the electron density from ref. 2 data. 
a, Simulated annealing mF, — DF, omit maps generated by phenix.refine 
with the OEC set to zero occupancy and surrounding atoms restrained. 
This is the classical protocol for producing SA-omit maps’!. Green and 

red mesh denotes +3.00; cyan mesh denotes +6.00; in the dark (PDB 

code 4PBU) and two-flash (PDB code 4Q54 with corrected monomer 
assignment, equivalent to PDB code 4RVY) states. No change of the 
density around Mn4 (indicated by an orange arrow) is visible when 
comparing dark and two-flash data. b, The phenix.autobuild SA-omit maps 
recalculated following the protocol of ref. 2, contoured at 1.50. Orange 
arrows indicate the claimed conformational change. No changes are visible 
in monomer 2. Purple spheres denote Mn; white sphere denotes Ca. 
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Figure 2 | Electron density variability between the two data sets. View 
of a larger area around the OEC in monomer 1, showing phenix.autobuild 
SA-omit maps contoured at 1.50 as in Fig. 1b (blue denotes 4PBU, 

purple denotes 4Q54 with corrected monomer assignment as in 4RVY). 
Orange ellipses denote structural differences claimed by Kupitz et al.’ 
Red ellipses denote additional map differences throughout the structure, 
showing that the claimed differences at the OEC are within the noise level 
of these electron density maps. 


Methods 


The maximum possible S3-state population was calculated using t)/2(S2—S3) = 70s, 
t1j2(Qa —-Qgz_) =0.54 ms*°, a miss factor of 10%, and 100% quinone occu- 
pancy of the Qs acceptor site. Alternatively, t).fast(Q, —+Qg )=0.2ms and 
t128Slow(Q,_ —-Qz_) = 1.6 ms with a ratio of 0.47:0.53, and an overall fraction of 
centres involved in Q, —+Qgz electron transfer of 0.75 was used®. Relative mono- 
mer placement between states was analysed with phenix.emma’, and the molecules 
placed on the same crystallographic origin using phenix.find_alt_orig_sym_mate®. 
This led to the conclusion that the monomers in the two-flash state coordinate file 
(PDB code 4Q54) are swapped relative to the dark state file (PDB code 4PBU). 
After bringing this to the authors’ attention, they deposited an updated coordinate 
file for the two-flash state (PDB code 4RVY) correcting the transposition of mono- 
mer labels. SA-omit maps were calculated using the atomic coordinates and experi- 
mental data deposited in the PDB for accessions 4PBU and 4Q54 (with correctly 
swapped monomer assignment as in 4RVY), with OEC atoms set to zero occupancy 
and using either phenix.refine®'? with surrounding atoms tightly restrained'! 
(Fig. 1a) or phenix.autobuild”° in the composite_omit mode (Figs 1b and 2). 
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FORUM Neuroscience 


Virtual reality explored 


Neuroscientists are increasingly using virtual reality to facilitate studies of animal behaviour, but whether 
behaviour in the virtual world mimics that in real life is a matter for debate. Here, scientists discuss the strengths 


and limitations of the approach. 


THE TOPIC IN BRIEF 

@ Virtual-reality (VR) systems simulate 
real-world inputs to one or more of an 
organism’s sensory neural circuits, then 
measure the subject’s actions and apply 
updates to sensory stimuli in response. 

@ In most rodent set-ups, the animal receives 
visual information from an immersive screen 
that spans its field of vision. The animal’s 
movements control the visual flow, thereby 
replicating the sensory-motor coupling of 
the real world. 


The best of 
both worlds 


MATTHIAS MINDERER 
& CHRISTOPHER D. HARVEY 


Ve. reality is a valuable tool for under- 
standing neural function because it 
combines precise experimental control with 
natural behaviours. It allows experiments that 
are not possible using real-world approaches. 
As such, it has increased our understanding 
of neural processes in subjects ranging from 
humans to insects. 

What are the experimental benefits of VR? 
First, the technology allows researchers to 
define explicitly and exhaustively the sensory 
cues that carry information about the virtual 
world. In real-world experiments, it is not 
possible to control all sensory cues. For exam- 
ple, when studying the contribution of visual 
cues to navigation, confounding information 
could be provided by unmeasured smells, 
sounds, textures and vestibular stimuli (inter- 
nal information about balance and spatial 
orientation). VR offers the means to add or 
remove sensory cues to test the contribution 
of each one to a neural code, and to build up 
a ‘minimal’ set of stimuli needed to produce 
a given behaviour or neural activity pattern. 

A second benefit comes from the ability 
to redefine the laws that link the subject’s 
actions to changes in its world. When an 
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@ Typically, movement is restricted by fixing 
the rodent’s head in position; this allows 
precise measurements of neural activity to 
be taken and correlated with motor actions 
in animals that are awake, rather than 
anaesthetized (Fig. 1). 

@ Many researchers think that VR is a 
valuable tool for studying both navigation 
and sensory systems. 

@ However, a body of work’ indicates that 
the way in which mice navigate in real and 
virtual worlds Is different. 


animal explores the real world, it is difficult to 
disentangle which neural responses are attrib- 
utable to the animal’s actions and which are 
caused by sensory stimuli, because the two are 
rigidly linked by the laws of physics. In VR, this 
link can be modified in informative ways — 
sensory and motor features can be dissociated 
by changing the gain or lag between an action 
and a subsequent update of the virtual environ- 
ment, or be made independent of one another 
for brief periods. Sensory and motor variables 
can therefore be separated while allowing the 
subject to interact naturally and actively with 
the sensory world. 

Third, VR increases the range of tools 
available to measure neural activity. Because 
the subject is usually constrained, techniques 
can be applied that are either not possible or 
of poorer quality in freely moving subjects. 
These include functional magnetic reso- 
nance imaging, high-resolution fluorescence 
imaging and intracellular single-neuron 
electrophysiology. 

Many studies have shown that animals can 
solve navigational tasks in virtual worlds*®. But 
the aspects of navigation that can be studied 
in VR depend on the experimental set-up — 
for instance, the number of sensory cues 
simulated, the degree of sensory immersion 
and how naturally the subject interacts with 
the virtual world. In VR experiments that pro- 
vide visual inputs and allow body rotations to 
trigger vestibular signals, neural activity pat- 
terns during navigation are consistent with 
those measured in real-world experiments”. 
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Furthermore, studies’* that remove key 
sensory inputs such as vestibular stimuli reveal 
which aspects of navigational neural activity 
depend on vestibular input and which can be 
supported by visual cues alone. Therefore, VR 
can recapitulate neural activity in real environ- 
ments, and VR experiments can be designed to 
create informative differences between neural 
function in real and virtual worlds. 

Overall, VR has yielded many insights into 
sensorimotor integration, decision-making 
and navigation®. But it is important to remem- 
ber that, like all reductionist approaches, 
VR requires a trade-off between improved 
experimental accessibility and consistency 
with natural processes — the optimum 
set-up depends on the research question 
being asked. For instance, in studies of sen- 
sorimotor integration, it is crucial to dis- 
sociate sensory and motor variables. In 

navigation studies, 


VRexperiments iors are needed t 
rag a ed probe the subject’s 
: ti internal model of 
iiferonces the physical world. 

VR must be used 
b etween neural judiciously, so that 
function in real its implementation 
and ae matches the needs 
worlds. 


of the question. 
Of course, this 
requirement applies to all experimental tools 
and is not specific to VR. 

In summary, we consider VR as bridging 
the gap between natural behaviour and con- 
ventional reductionist approaches; this is a 
major step forward in the study of complex 
behaviours in many species. As the com- 
munity of VR users grows and commer- 
cial VR technologies expand, we expect the 
range of applications for VR to continue 
to grow, enhancing our understanding of 
neural function. 
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A world away 
from reality 


FLAVIO DONATO & EDVARD I. MOSER 


ere) that involves VR has obvious 
advantages for studies of simple sensori- 
motor computations, in which a defined set 
of inputs, such as those corresponding to an 
animal’s movement, is associated linearly 
with neural output. However, some pressing 
concerns are raised when VR technology is 
used to study higher-order computations 
such as spatial navigation. Navigation reflects 
the integration of many sensory inputs. The 
resulting outputs are not linearly related to sen- 
sory perception, but rather express cognitive 
abstractions. 

Goal-driven navigation relies on several cell 
types in the brain, including place cells (which 
fire when an animal is ina particular location), 
grid cells (which fire at periodically spaced 
positions across the entire environment) and 
border cells (which fire selectively along local 
borders)’*. By fixing an animal’s head in place, 
investigators can monitor the activity of these 
neurons at high resolution while the animal 
runs between specific locations in virtual 
space. But do animals navigate in the same way 
in VRas in real life? 

Navigating in the real world is a multi- 
sensory process that integrates visual, olfactory 
and tactile stimuli with vestibular information 
and information about the activity of moving 
body parts. But in VR, these elements are often 
not coordinated, and the animal’s sensory 
experience is largely reduced to a combina- 
tion of visual inputs and locomotion, which 
are easy to control. The animal must overcome 
discrepancies between visual cues that follow 
movements and cues that are static in VR, such 
as smell or head direction. Conflicts between 
movement and sensory inputs might alter the 
activity of space-encoding neurons to reflect 
only information coordinated to motion, such 
as visually changing landmarks and accumu- 
lated distance’”, at the expense of other cues. 
This could lead researchers to overestimate the 
contribution of visual inputs to navigation and, 
in the most extreme cases, might lead to the 
loss of computation altogether’. 

A particular concern is whether the loss 
of vestibular input that accompanies move- 
ment restriction affects animals’ computa- 
tion of their position. A continuous mismatch 
between vestibular and visual inputs might not 
be detrimental in linear environments. When 
an animal runs in a straight line, visual inputs 
are repeatedly and stereotypically paired to 
the same locomotor information, which may, 
with continued training, allow the animal to 
compensate for mismatches. However, such 
a mismatch might have a greater effect in 
two-dimensional or 3D VR arenas. 
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Figure 1 | A mouse explores a virtual world. In most typical virtual-reality experiments, a mouse 

is head-fixed above a ball. Its legs are free, allowing it to move the ball in all directions. By moving 

the ball, the mouse navigates around a virtual world that is projected onto a 270° doughnut-shaped screen 
in front of it. Head fixing enables neural activity to be measured and correlated with the motor actions 


that drive movement. 


Indeed, if movement is unrestrained, the 
position-coding activity of place and grid cells 
is similar in 2D VR to that in the real world’. 
In stark contrast to this, position coding is 
disrupted and a new coding emerges when 

body movement 
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tions of head or 
body restriction. Strategies that compensate 
for the loss of synchrony between vestibular 
information and the animal's behaviour would 
be a welcome advance. 

Finally, are all types of position-coding 
cell represented in VR-based navigation? It is 
unclear if and how border, speed and head- 
direction cells are activated when movement 
is restricted. Moreover, cells might not fire 
in the same way in the two worlds. In one 
analysis’, 60% of the place cells activated 
in the real world were silent in VR. Whereas 
studies typically check that VR-activated 
cells are represented in real-world sessions, 
the opposite direction of investigation lags 
behind — although there are exceptions 
to this’. 

More than 40 years ago, the neuroscientist 
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John O’Keefe changed our understanding of 
the physiology of navigation by studying rats 
freely foraging for food. By allowing the natu- 
ral sensory-motor interactions required for 
the formation of an internal representation of 
space, O’Keefe discovered the first element of 
the ‘cognitive map’ — the place cell’. VR can 
extend that ecological approach to higher cog- 
nitive functions. But to do so successfully, the 
technology needs further development and 
validation. m= 


Flavio Donato and Edvard I. Moser are 

at the Kavli Institute for Systems 
Neuroscience, Norwegian University 

of Science and Technology, 7491 Trondheim, 
Norway. 

e-mails: flavio.donato@ntnu.no; 
edvard.moser@ntnu.no 


1. Ravassard, P. et al. Science 340, 1342-1346 
(2013). 

2. Aghajan, Z. M. et al. Nature Neurosci. 18, 121-128 
(2015). 

3. Acharya, L. etal. Cell 164, 197-207 (2016). 

4. Harvey, C. D., Coen, P. & Tank, D. W. Nature 484, 
62-68 (2012). 

5. Aronov, D. & Tank, D. W. Neuron 84, 442-456 
(2014). 

6. Dombeck, D. A. & Reiser, M. B. Curr. Opin. Neurobiol. 
22, 3-10 (2012). 

7. Moser, E. |., Kropff, E. & Moser, M.-B. Annu. Rev. 
Neurosci. 31, 69-89 (2008). 

8. Rowland, D.C., Roudi, Y., Moser, M.-B. & Moser, E. |. 
Annu. Rev. Neurosci. http://dx.doi.org/10.1146/ 
annurev-neuro-070815-013824 (2016). 

9. O'Keefe, J. & Dostrovsky, J. Brain Res. 34, 171-175 
(1971). 


This article was published online on 11 May 2016. 


19 MAY 2016 | VOL 533 | NATURE | 325 


eeu NEWS & VIEWS 


DRUG DISCOVERY 


Fighting evolution with 
chemical synthesis 


A synthetic strategy has been developed that provides easy access to structurally 
diverse analogues of naturally occurring antibiotics, providing a fresh means of 
attack in the war against drug-resistant bacteria. SEE ARTICLE P.338 


MING YAN & PHIL S. BARAN 


strong resistance to various antibiotics 
as a result of the rapid evolution of the 
bacteria in response to widespread antibiotic 
misuse. This problem is exacerbated by the 
reduction in antibacterial-drug research by 
large pharmaceutical companies over the past 
few decades — we are failing to replenish our 
armamentarium quickly enough to compen- 
sate. An apocalyptic scenario looms wherein 
a drug-resistant ‘superbug’ becomes so well 
established that the most trivial infections 
would be as life-threatening as they were dur- 
ing the pre-antibiotic era. On page 338 of this 
issue, Seiple et al.’ offer an ingenious strategy 
to bolster the antibiotic-discovery process: 
a highly modular and versatile chemical- 
synthesis approach that provides access to a 
rich array of antibiotics possessing different 
molecular topologies and functionalities. 
Naturally occurring compounds derived 
from bacteria or fungi have long been the pri- 
mary source of antibiotics. These compounds 
can be produced on a large scale by fermen- 
tation, but few inherently possess adequate 
bioavailability, in vivo activity or metabolic sta- 
bility for use as drugs’. Chemical modifications 


Pisrensses bacteria have developed 


are often necessary to address these issues. 
The complexity of natural products means 
that such modifications often involve a long, 
linear sequence of steps in which chemical 
groups in the molecules are masked with ‘pro- 
tecting’ groups to allow the modifications to 
be made, and then unmasked. Making vari- 
ations at certain molecular locations tends to 
be particularly laborious or impractical, and 
so synthesizing a diverse collection of anti- 
biotic candidates for drug discovery can be 
forbiddingly difficult. 

Seiple and colleagues show convincingly 
that chemical synthesis can surmount the chal- 
lenge of molecular diversification, trumping 
strategies based on synthetic biology’ or semi- 
synthesis (in which compounds from natural 
sources are used as starting materials). In 
their approach, the authors constructed com- 
plex macrolides — antibiotics that contain a 
characteristic large ring of atoms known as a 
macrocycle — through the assembly of eight 
modular building blocks. These simple mod- 
ules could each be varied easily to incorporate 
different structural motifs at specific locations 
in the macrolide skeleton. 

The researchers’ synthetic route included 
seven key coupling reactions, each of which 
introduced different building blocks into the 


Solithromycin 


Figure 1 | A modular approach to macrolide synthesis. Seiple et al.' report a synthetic route that allows 
the rapid preparation of a diverse array of structural analogues of macrolide antibiotics. The authors 
prepared eight modular building blocks (coloured shapes), the structures of which can be easily modified. 
The synthetic route includes seven key steps in which the building blocks are joined together. Chemical 
groups in the modules can also be readily converted into various other groups after the modules have 
been incorporated into macrolide intermediates (not shown). This approach allowed the authors to 
prepare more than 300 macrolides, including the clinical candidate solithromycin. 
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macrolide (Fig. 1). Such reactions are said to be 
convergent because they bring together mod- 
ules that were prepared in separate, parallel 
reaction sequences. The convergent coupling 
reactions serve as branch points in the syn- 
thesis that allow a diverse range of structural 
modifications to be easily made. For example, 
Seiple et al. introduced reactive groups such 
as azides, allyl groups and amides at different 
positions, all of which could be readily con- 
verted into various structural motifs. Such 
manipulation, when combined with variations 
in the building blocks, enables an exponential 
expansion of diversity. 

This approach is possible only because the 
synthetic sequence is extremely well choreo- 
graphed. Steps such as protecting-group 
manoeuvres are kept to a minimum; conver- 
gent couplings and other complexity-building 
transformations account for the bulk of the 
sequence. These key reactions are mechani- 
stically very different from one another 
(ranging between the addition of an enolate 
and a transition-metal-catalysed cycloaddi- 
tion, for those in the know). Yet the authors 
show that these all work well for modules that 
carry a variety of groups. 

The formation of macrocyclic rings is one 
of the toughest challenges in macrolide syn- 
thesis — many routes to these antibiotics are 
foiled because attempts to form the ring fail’. 
Conventionally, macrocycle formation must be 
individually optimized for each reaction sub- 
strate by finding a set of protecting groups that 
allows the transformation to occur. However, 
Seiple and colleagues report that, in their work, 
macrolides containing various ring sizes can 
be made using almost identical ring-formation 
conditions. 

Similarly, other reactions in the synthetic 
sequence need minimal adaptation to work 
for different substrates that contain a variety of 
chemical groups, and deliver consistent yields 
of products. Such generality is the cornerstone 
of the authors’ approach, because it allows 
hundreds of analogues of macrolides, bearing 
highly dissimilar arrangements of attached 
groups, to be accessed expeditiously with little 
perturbation to the overall strategy. 

Another prerequisite for rapidly generat- 
ing structural diversity is an abundant sup- 
ply of diversifiable intermediates; this in turn 
requires all the chemical transformations in the 
syntheses to be conducted on gram scales”. Sei- 
ple et al. tackled this challenge using a series of 
in-house methodologies. The first was a scal- 
able aldol reaction, a type of carbon-carbon 
bond-forming reaction. This reaction shows 
exceptional stereoselectivity (it preferentially 
forms isomers that have specific 3D orienta- 
tions of atoms) and tolerates the presence of a 
potentially interfering amino group (NH,) in 
one of the reactants’. The second was a con- 
cise approach’ to preparing the carbohydrate 
desosamine and its analogues, which form 
a key part of another building block. These 


examples highlight the synergy between 
target-oriented synthesis and methodology 
developments. 

Using such scalable methods allowed the 
authors to make gram quantities of sev- 
eral macrolides — a large amount for drug- 
discovery purposes — in remarkably high 
overall yields. This reaffirms the fact that scal- 
able synthetic methods that can provide ample 
supplies of structural analogues of molecules 
are essential for the continued success of drug 
discovery’. Given the sheer number of pos- 
sible macrolide analogues, some structural 
variations will remain challenging to produce 
synthetically, but further advances in synthetic 
methods will probably bridge the gaps. 

Impressively, Seiple and colleagues prepared 
more than 300 macrolide antibiotics as candi- 
dates for pharmaceutical development. The 
researchers report that most of these com- 
pounds inhibit the growth of drug-resistant 


NEUTRINO PHYSICS 


pathogens, including some extremely 
challenging strains. Further studies are neces- 
sary for assessing properties (such as toxicity 
and bioavailability) that will affect the antibi- 
otics’ clinical viability, but these compounds 
represent a gateway to commercial pharmaceu- 
ticals. One of the compounds is more effective 
against Gram-negative bacteria than are many 
other macrolides. Its activity is still modest, 
but marks a promising start to tackling an 
important medical need. 

No antibiotic is impervious to the power 
of evolution — bacteria will always mutate to 
develop resistance, which means that new anti- 
biotics must be pursued continuously and per- 
petually. To this end, synthetic platforms such 
as the present one offer rapid access to unparal- 
leled structural variability for antibiotic candi- 
dates, while still allowing exquisite control of 
each variation. This fine-tunability, along with 
biological knowledge, can enable chemists to 


A deliberate mix-up 


in flavour 


Neutrinos come in three ‘flavours’, as do antineutrinos, and they all change 
flavour as they travel. New measurements of the mixing of different neutrinos 
may help to explain why our Universe is made of matter and not antimatter. 


HELEN O'KEEFFE 


or decades, physicists have been 
f intrigued as to why the visible Universe 

is made of matter, rather than antimatter. 
Intuitively, studying the smallest of the known 
fundamental particles, the neutrino, may 
reveal the answer to this question. As neutrinos 
travel through space and time, they can oscil- 
late between three different types (flavours). If 
the way in which antineutrinos change flavour 
differs from the way neutrinos change flavour, 
these relatively small differences may have led 
to a matter-based Universe. 

Experiments have been set up to study the 
amplitude and frequency of these changes to 
determine the measurable parameters associ- 
ated with flavour oscillations, such as mass dif- 
ferences (neutrinos have three possible masses, 
denoted 1, 2 and 3) and mixing angles (how 
much mixing occurs between neutrinos of 
mass 1, 2 and 3). Writing in Physical Review 
D, the Daya Bay Collaboration’ reports that 
it has used a new detection method to make 
the most precise measurement yet of 0,,, the 
mixing angle between neutrino mass 1 and 
neutrino mass 3. This will help in our quest 
to understand the origin of the dominance of 
matter in the Universe. 


The Daya Bay experiment is located on 
the southern coast of China, approximately 
55 kilometres northeast of Hong Kong. The 
site consists of eight identical antineutrino 
detectors, located between 360 m and 1.9 km 
from the Daya Bay and Ling Ao nuclear power 
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keep up with bacterial evolution. In the midst 
of the antibiotic-resistance crisis, Seiple and 
colleagues’ work sends out a clear message: 
chemical synthesis is the way forward. = 
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plants. Approximately 3,500 billion billion 
electron antineutrinos (one flavour of anti- 
neutrino) are emitted from the nuclear reac- 
tors every second, and a small number of these 
interact with the 20 tonnes of gadolinium- 
doped liquid scintillator in each detector; the 
scintillator emits light when it absorbs an ener- 
getic particle. The electron antineutrinos inter- 
act with protons in the nuclei of the scintillator 
molecules to produce positrons (antielectrons) 
and neutrons in a process known as inverse 
B-decay. Within a few nanoseconds, the posi- 
trons produce light by scintillation, and this is 
detected by an array of photomultiplier tubes, 
which amplify the incident light. 

When a neutron is captured by a gadolinium 
nucleus embedded in the scintillator, it forms 
an excited state, which subsequently decays 
to a more-stable state by emitting a series of 
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Figure 1 | Neutrino measurements. The mixing angle 6,, is a measure of how much neutrinos that 

have masses 1 and 3 mix. The graph compares measurements of 6,, made by the Daya Bay', Double 
Chooz’ and T2K (ref. 4) experiments; values are plotted as sin’(26,,). Two values are shown for the T2K 
experiment because the best-fit value of @,, depends on the ordering of the neutrino masses: m,>m,>m, 
is known as the normal hierarchy (NH), whereas m,>m, >m, is the inverted hierarchy (IH). The Daya 
Bay measurement is the most precise so far. A non-zero 6, would open the way to experiments that might 
explain why the Universe is dominated by matter rather than antimatter. 
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y-rays with a total energy of 8 megaelectron- 
volts. These y-rays scatter and interact with 
the scintillator to produce another light pulse, 
typically tens to hundreds of microseconds 
after the first pulse that was generated by the 
positrons. Searching for two coincident light 
pulses that have specific properties can con- 
firm the detection of electron antineutrinos. 

Other reactor-based experiments” use the 
same gadolinium capture process. But the Daya 
Bay Collaboration now reports an additional, 
independent method to identify neutrons pro- 
duced through inverse -decay. In the detec- 
tors, neutrons are preferentially captured on 
gadolinium, but capture on a hydrogen atom 
of the organic liquid scintillator is also possible. 
Neutron capture on hydrogen produces deu- 
terium and releases a low-energy (2.2-MeV) 
y-ray, but this is hard to detect because many 
similar y-rays with energies below 4 MeV are 
emitted from decays of naturally occurring 
radioisotopes, such as potassium-40 (1.4 MeV) 
and thallium-208 (2.6 MeV). Such y-rays are 
problematic because they can mimic those 
released by neutron capture on hydrogen, 
obscuring the antineutrino signals. Radioac- 
tivity in the detector must therefore be mini- 
mized and controlled by careful purification of 
the scintillator, selection of low-radioactivity 
materials and stringent cleanliness. Despite 
such precautions, radioactivity is never com- 
pletely eliminated, and this ultimately limits the 
precision of the experiment. 

Detecting inverse B-decay from anti- 
neutrino interactions requires the identifica- 
tion of two light signals occurring within 1 to 
1,400 microseconds of each other. This timing 
requirement reduces the impact of coinciden- 
tal light flashes from background radiation 
that are not time correlated. The Daya Bay 
Collaboration was able to estimate the rate of 
accidental coincidences by using data recorded 
in its detectors. This quantity, subtracted from 
the total number of observed coincidences, 
yields the number of time-correlated coinci- 
dences. 

Further corrections were required to 
account for any time-correlated backgrounds, 
the most significant of which arises from the 
activation by cosmic rays of carbon-12 in the 
scintillator to produce lithium-9 and helium-8. 
Both of these isotopes decay by B-emission to 
neutron-unstable excited states, producing a 
signal of an electron followed by neutron cap- 
ture that is indistinguishable from the signal 
from inverse B-decay. 

When all the backgrounds were appropriately 
accounted for, the collaboration used the num- 
ber of genuine time-correlated coincidences to 
measure the flux of electron antineutrinos. The 
measured number of electron antineutrinos 
was found to be less than the collaboration had 
predicted [OK?]from reactor flux models. The 
missing electron antineutrinos demonstrate 
that they oscillate into other flavours that 
cannot be detected in this experiment. 
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This analysis from the Daya Bay experiment 
provides an independent measurement of the 
number of electron-antineutrino interactions 
in the detectors and represents an important 
cross-check of previous measurements of the 
mixing angle 6,;. Moreover, the combination 
of the collaboration’s latest hydrogen-capture 
results with those from its gadolinium analysis 
provides the most precise measurement of 0,, 
so far, where sin’(20,;) =0.082 + 0.004, an 8% 
improvement in precision. 

This measurement compares favourably 
with those from other reactor studies”’ and 
long-baseline experiments” ’, which provide 
long distances over which to observe changes 
in neutrino flavour. Closer examination of the 
results (Fig. 1) suggests slight discrepancies 
between current measurements of 6,;. Results 
from Double Chooz’ and T2K (ref. 4) tend 
to favour higher values of 0,,, although not 
significantly so. 

Measurements of 6,,; are particularly 
valuable in our quest to understand the matter- 
dominated Universe through measurement 
of another neutrino oscillation parameter, 
dcp. Non-zero 6-p would indicate that neutri- 
nos violate charge—parity symmetry (under 
this symmetry, the laws of physics should be 
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the same when a particle is replaced by its 
antiparticle and inverted through a mirror); 
this may explain why the Universe is dominated 
by matter. Combining results from reactor and 
long-baseline experiments provides a hint that 
dcp is indeed non-zero, but current experiments 
are not precise enough to reach the statistically 
significant ‘5 sigma’ discovery threshold. The 
next generation of experiments” will seek to 
make the first measurement of 6. and thus 
help us to understand the origin of our matter- 
dominated Universe. = 
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Limb regrowth 


takes two 


Salamanders can regenerate several of their organs, including amputated limbs. 
Analysis of a Mexican salamander shows that crosstalk between two signalling 
molecules regulates limb regeneration. SEE LETTER P.407 


MIGUEL TORRES 


he ability of salamanders to regenerate 
their limbs following amputation was 
first described in the eighteenth century 
by the Italian naturalist Lazzaro Spallanzani’. 
Researchers have been trying to identify the 
formula for regeneration ever since, in the 
hope of reawakening this lost ability in mam- 
mals. Experiments performed 40 years ago” 
showed that limb regeneration requires the 
participation of tissues from both sides of 
the limb — the side closer to the thumb (the 
anterior side) and the opposite, posterior side. 
However, the mechanisms involved in this 
process have remained a mystery. On page 407, 
Nacu et al.’ show that two diffusible signal- 
ling molecules underlie this requirement for 
anterior and posterior tissue. 
During limb regeneration, cells from the 
limb stump form a mass of progenitor cells 
called the blastema, which gives rise to the new 
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structure’. There are three requirements for this 
process. First, wounding must occur. This may 
seem obvious, but wounding triggers several 
tissue-repair responses that are essential for 
regeneration. The second requirement is proper 
innervation, because limbs from which nerves 
have been removed do not regenerate’. Finally, 
the blastema must contain abutting domains of 
cells derived from the anterior-most and poste- 
rior-most positions in the limb stump’. This last 
observation indicates that crosstalk between 
cells of differing origins triggers mechanisms 
that are essential for regeneration. 

If wounding on the anterior side of the limb 
is combined with redirection of nerves from 
the limb proper into the wounded site, and 
with transplantation of a patch of posterior 
limb tissue onto the wound, an extra (acces- 
sory) limb can form’. By contrast, wounding 
and nerve redirection without transplanta- 
tion result in a ‘frozen’ blastema — regen- 
eration does not occur owing to the lack of 


Anterior cells 
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a_ Anterior-only blastema 
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Figure 1 | Complementary signals promote limb regeneration. a, In 
salamanders, if a limb is wounded and nerves are redirected to the wound 
site, an extra limb can arise from a cell cluster called a blastema. However, the 
blastema generates a new limb only if it contains cells from both the anterior 
and posterior sides of the original limb. Nacu et al.’ report that blastemas that 
contain only anterior tissue cannot form limbs because they do not receive a 


posterior tissue. Using highly regenerative 
Mexican salamanders called axolotls, Nacu and 
colleagues took this ‘anterior-only’ experimen- 
tal set-up as a starting point to precisely test 
the molecular pathways involved in antero- 
posterior crosstalk. The authors found that 
treating frozen blastemas with a molecule 
called smoothened agonist (SAG) reawakened 
the tissue, leading to production of an accessory 
limb from exclusively anterior tissue. 

SAG mimics the activity of a protein called 
Sonic hedgehog (Shh), which is normally 
secreted by the posterior-most cells of the blas- 
tema. Shh diffuses across the tissue, binding 
to receptors on the surface of receiving cells to 
trigger intracellular signalling cascades. Nacu 
et al. found that stimulation of Shh signalling 
in anterior blastema cells by SAG activated 
another secreted diffusible protein, fibro- 
blast growth factor 8 (FGF8). Sustained FGF8 
activity was necessary for anterior-only limb 
production (Fig. 1a). Furthermore, activation 
of FGF8 could induce anterior limb growth 
even in the absence of Shh signalling. 

Like anterior-only blastemas, posterior-only 
blastemas cannot regenerate, despite contain- 
ing Shh-expressing cells. Investigating this 
seeming contradiction, the authors found that 
posterior tissue could not activate strong and 
sustained FGF$8 signalling in response to Shh 
(Fig. 1b). Forced FGF8 expression in the pos- 
terior tissue led to the production of accessory 
limbs — but, in this case, only when Shh signal- 
ling was also active, because inhibition of Shh 
blocked regeneration. Finally, Nacu et al. con- 
firmed that signalling by Shh and FGF8 was 
required for physiological limb regeneration 
after amputation (Fig. Ic). 

In summary, the authors have neatly 
dissected the antero-posterior crosstalk 
required for regeneration, and have defined 
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the factors that induce blastema development. 
Anterior cells cannot regenerate alone because 
they lack Shh signalling from posterior cells. 
And posterior cells cannot regenerate alone 
because they are unable to respond to Shh with 
sustained activation of FGF8. However, why 
forced FGF8 expression in the absence of Shh 
signalling cannot promote posterior-only limb 
growth remains unexplained. 

In future analyses, it will be important to 
determine whether this finding reflects the fact 
that anterior and posterior tissues have differ- 
ent requirements for regeneration. An alterna- 
tive possibility is that it reflects experimental 
design limitations — for example, nonspecific 
effects of the drug that the researchers used to 
inhibit Shh in the posterior cells. Similarly, the 
reason for anterior-specific FGF8 activation 
by Shh remains unexplained, and should be a 
focus of future research. 

The mechanisms at work during limb 
regeneration have long been debated, and mod- 
els fall into two broad classes. Those in the first 
class posit that every cell contains positional 
information, and that regeneration involves 
only local cellular interactions. By contrast, 
models in the second class invoke the action of 
diffusible signals produced by specific regions 
in the blastema called organizers, which direct 
development of the surrounding tissue. 

Nacu and colleagues’ work favours organizer 
models, but does not exclude some degree of 
positional encoding. Consistent with organizer 
models, the authors clearly demonstrate that 
anterior cells do not produce Shh, and 
that anterior-only accessory limbs do not 
acquire posterior characteristics (for exam- 
ple, they lack digits that normally arise 
on the posterior side of the limb). But the 
authors’ data also indicate that blastema cells 
tend to form sharply defined anterior and 
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signalling protein called Shh, which is secreted by the posterior cells. Anterior 
cells normally produce the signalling protein FGF8 in response to Shh 
signalling, triggering limb formation. b, By contrast, posterior-only blastemas 
receive Shh, but are unable to activate FGF8 in response. c, Following limb 
amputation, both posterior and anterior cells contribute to the blastema, and 
regeneration is promoted by cross-induction between Shh and FGF8. 


posterior domains, suggesting that cells identify 
themselves according to their position within 
broad domains — as anterior or posterior, for 
instance, and, along a perpendicular axis, as 
dorsal or ventral. In several ways, these obser- 
vations fit the boundary model’, which states 
that organizer regions that produce diffus- 
ible signalling molecules are induced at the 
boundaries between broad limb domains. 
Shh and FGF8 have well-defined roles 
in embryonic limb development in various 
vertebrate species. A positive feedback loop 
between the two signalling pathways has been 
described, and involves relay signalling by 
Gremlin, a third secreted diffusible protein®”. 
Nacu et al. identified robust Gremlin expres- 
sion associated with the interaction between 
anterior and posterior tissue, suggesting that 
this feedback loop probably also operates 
during limb regeneration in salamanders. 
Given the evolutionary conservation of 
the Shh and FGF pathways, the current study 
brings us a step closer to the possibility of using 
defined factors to promote tissue regeneration 
in mammals — provided that a blastema can be 
induced. This advance, together with the same 
groups recent identification” of a diffusible 
molecule called MLP that is responsible for the 
cell proliferation leading to blastema formation, 
represents substantial progress in the quest for a 
general formula for organ regeneration. = 
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Illuminating 
brown dwarfs 


Objects known as brown dwarfs are midway between stars and planets in mass. 
Observations of a hot brown dwarf irradiated by a nearby star will help to fill a gap 
in our knowledge of the atmospheres of fluid planetary objects. SEE LETTER P.366 


ADAM P. SHOWMAN 


he illumination received from a nearby 

star has a crucial role in shaping an 

atmosphere’s three-dimensional tem- 
perature structure, chemistry, climate and 
weather. Less obviously, the irradiation of one 
star by another nearby star — as commonly 
occurs in tightly orbiting binary star systems 
— can lead to observable’ temperature differ- 
ences between the illuminated star's ‘dayside’ 
and its ‘nightside. However, only a few obser- 
vations have documented the effect of stellar 
irradiation on the atmospheres of a class of 
object that is intermediate in mass between 
stars and planets: brown dwarfs. On page 366 
of this issue, Hernandez Santisteban et al.” pre- 
sent intriguing observations to characterize the 
atmosphere and estimate the day-night tem- 
perature difference for a brown dwarf irradi- 
ated by a nearby star. 

When Sun-like stars reach old age, their 
outer layers expand, allowing them to engulf 
nearby, closely orbiting companions. Small 
stars or brown dwarfs often survive this ordeal, 
but friction caused by the orbital movement 
of the companion through the tenuous outer 
layers of the star provides a drag on the com- 
panion, causing it to spiral slowly inward. The 
outer layers of the bloated star eventually puff 
off into space, leaving behind a stellar remnant 
called a white dwarf, which is typically Earth- 
sized but halfas massive as the Sun. By the end 
of this process, the companion’ orbit has often 
shrunk to the point that the two objects nearly 
touch. 

The system characterized by Hernandez 
Santisteban et al., dubbed J1433, is just such 
a system, consisting of a white and a brown 
dwarf. The objects are so close that they orbit 
each other every 78 minutes. The gravity from 
the white dwarf distorts the shape of the brown 
dwarf and leads to a trickle of mass from the 
companion, which slowly accretes onto the 
white dwarf. 

The closeness of the white and brown dwarfs 
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means that they cannot be resolved individu- 
ally in images. However, the white dwarf has a 
temperature that exceeds 13,000 kelvin (more 
than double the temperature at the surface of 
the Sun), which causes most of its radiation 
to escape at short, ultraviolet wavelengths. By 
contrast, the brown dwarf’s surface tempera- 
ture is about 2,400 K, and most of its radiation 
escapes in the near-infrared region of the spec- 
trum. Although the white dwarf emits a greater 
total energy flux, the cooler but larger brown 
dwarf dominates the system’s flux in the near 
infrared. Radiation in this wavelength range 
thus allows the brown dwarf’s atmosphere 
to be characterized. Hernandez Santisteban 
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et al. obtained high-resolution spectra that 
extended from the ultraviolet to the infrared, 
allowing the authors to tease apart light from 
the two objects. 

To determine how irradiation affects the 
brown dwarf, the authors tracked how infrared 
light from the brown dwarf changes through- 
out its orbit. The system's orbital plane lies 
nearly in the line of sight to Earth, implying that 
the brown dwarf’s day and night hemispheres 
rotate in and out of view throughout the orbit. 
The researchers’ observations show that the 
average dayside temperature is about 57 K 
warmer than the average nightside tempera- 
ture. The hottest dayside region is about 200K 
warmer than the coolest nightside region. 

These observations are important, given the 
substantial effort over the past two decades 
to understand the atmospheres of irradiated 
exoplanets called hot Jupiters — Jupiter-mass 
planets that orbit very close to their stars and 
that are blasted by starlight. Hot Jupiters com- 
monly have daysides many hundreds of kel- 
vins hotter than their nightsides*’. But they are 
typically about 1,000-10,000 times dimmer 
than their host stars in the infrared, making 
their observation extremely difficult. The fact 
that brown dwarfs in systems such as J1433 are 
brighter in the infrared than their white-dwarf 
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Figure 1 | A wide parameter space for fluid planetary bodies. Fluid planets and brown dwarfs 

(which are intermediate between planets and stars) are subject to a broad range of internal heat fluxes 

and external irradiation from nearby stars. Giant planets in our Solar System, such as Neptune and Jupiter, 
have low internal and external heat fluxes®. Warm and hot Jupiters — Jupiter-sized exoplanets — have 
much higher external irradiation’, whereas Y-, T- and L-type brown dwarfs have much higher internal 
fluxes'’. Hernandez Santisteban et al.’ report observations of J1433, a system in which a brown dwarf 
occupies a tight orbit around a white dwarf star. This brown dwarf has high internal flux and experiences 
high external irradiation. Observations of other J1433-like brown dwarfs and of young hot Jupiters will 
therefore provide information that will build a complete picture of atmospheric dynamics in fluid planets 
and brown dwarfs. Ellipses indicate approximate ranges; the broken grey line indicates equal internal and 


external fluxes. 
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primaries suggests that the atmospheres of 
these irradiated objects can be more easily 
characterized than can those of hot Jupiters, 
which might allow insight into the workings 
of the harder-to-observe planets. Several other 
white dwarf-brown dwarf binaries are known 
to exist, and may yield constraints on the cli- 
mate of irradiated fluid objects that are at least 
as good as those from J1433 (refs 5, 6). 

J1433-like systems also allow comparisons 
with other brown dwarfs. Most known brown 
dwarfs are isolated and receive no irradiation, 
so they gradually lose heat from their interiors 
and cool off over billions of years. This heat is 
transported through their interiors by convec- 
tion, which drives an active atmospheric circu- 
lation that manifests as patchy, time-variable 
clouds that cause significant changes in infra- 
red flux over time’. Much work is being done 
to understand this variability and the processes 
that control the surface patchiness. The extent 
to which these dynamical processes will be 
modified by external irradiation is unknown; 
future observations of J1433 and other irradi- 
ated brown dwarfs”® will help to answer this 
question. 

The giant planets in our Solar System (such 
as Jupiter, Saturn and Neptune) experience 
internal and external heat fluxes that are weak 
and comparable to each other*. By contrast, 
hot Jupiters receive external fluxes about a 
thousand to a million times greater than their 
expected internal fluxes’, and thereby show 
us how atmospheric circulation responds 
when external forcing dominates. Isolated 
brown dwarfs represent the opposite extreme, 
transporting enormous internal fluxes but typ- 
ically receiving negligible external irradiation. 
These types of body therefore constrain three 
corners of a broad parameter space of exter- 
nal irradiation and internal heat flux that spans 
many orders of magnitude in both parameters 
(Fig. 1). Until a few years ago, we lacked obser- 
vational constraints on the atmospheric behav- 
iour of substellar objects at the fourth corner 
of that parameter space — those subject to 
enormous external irradiation and inter- 
nal heat flux that are comparable to within a 
factor of ten. 

J1433 and related brown dwarf-—white dwarf 
binaries fill that gap, and could prove crucial 
in the quest to understand how atmospheric 
circulation depends on internal and external 
forcing. The small day-night temperature dif- 
ference inferred by Hernandez Santisteban 
et al. relative to that of many hot Jupiters** 
almost certainly results from the intense heat 
supplied to the atmosphere from the brown 
dwarf’s interior, but the interaction of the 
internal and external forcings could have 
myriad other consequences that remain poorly 
understood. 

The J1433 system is interesting in other 
ways. Hernandez Santisteban et al. argue 
that the brown dwarf began life as a star, but 
became a brown dwarf after losing mass to the 


white dwarf — a history that might affect its 
internal structure and atmospheric circulation. 
Moreover, because of the fortuitous orbital 
alignment of J1433 with the line of sight to 
Earth, the brown and white dwarfs eclipse each 
other once per orbit, providing an opportunity 
to characterize the atmospheric composition 
and thermal structure in the way that is com- 
monly done for hot-Jupiter systems. = 
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Snapshots of 
transcription initiation 


The enzyme RNA polymerase II, along with several transcription factors, initiates 
DNA transcription. Analyses reveal the structures involved in this process in 
human and yeast cells at high-resolution. SEE ARTICLES P.353 & P.359 


STEVEN HAHN & STEPHEN BURATOWSKI 


he initiation of DNA transcription 

involves a fascinating interplay between 

RNA-synthesizing RNA polymerase 
(Pol) enzymes, transcription factors and DNA. 
The Pol II complex is of particular interest 
because it synthesizes all messenger RNA in 
eukaryotic (nucleus-bearing) cells. The size 
and flexibility of Pol II complexes present huge 
challenges for structural biologists, but two 
studies in this issue, by He et al." (page 359) and 
Plaschka et al.” (page 353), exploit advances in 
cryo-electron microscopy to produce near- 
atomic-resolution snapshots of the Pol II 
machinery. 

The bacterial Pol machinery is a streamlined 
system that contains only four Pol subunits and 
a single transcription factor, sigma’. Because 
Pol active sites are highly evolutionarily con- 
served’, bacterial Pol has been used to estab- 
lish a general model of Pol action. This model 
suggests that Pols and their transcription fac- 
tors first associate with the promoter region of 
double-stranded DNA, which lies immediately 
upstream of the sequences to be transcribed, 
to form a structure called the closed complex. 

Next, around 10-13 base pairs of the 
promoter unwind, positioning the DNA 
strand to be transcribed at the Pol active site 
in an open complex (open and closed refer 
to the state of the DNA). Pol subunits form 
channels for incoming nucleotides and the 
exiting mRNA, and create a deep cleft for 
the template strand. A mobile clamp domain 
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traps DNA in the active site during the 
transition from the closed to the open complex. 
Finally, the structure contorts into an initial 
transcribing complex, maintaining contacts 
with promoter DNA while downstream DNA 
is pulled into the active site as RNA starts to 
be synthesized. 

In comparison with bacteria, the archaeal 
and eukaryotic transcription machineries are 
complex, with 12-17 Pol subunits and up to 
6 transcription factors. In the Pol II system, 
the transcription factors TBP, TFIIA, TFIIB, 
TFUE, TFUF and TFIIH are all required 
and, between them, perform the same func- 
tions as bacterial sigma”®. Years of bio- 
chemical, molecular and structural studies 
have probed the roles of each transcription 
factor to piece together a model of eukaryotic 
transcription initiation’. 

Many features of this model are brought to 
life in the current work. Both groups assem- 
bled purified transcription factors and Pol II 
on nucleic-acid scaffolds — on double-stranded 
promoter DNA for the closed complex, double- 
stranded DNA containing an unwound ‘bubble’ 
for the open complex, and a bubble with a short 
annealed RNA to resemble the initial transcrib- 
ing complex. He et al. used a complete set of 
human factors and all three scaffolds, whereas 
Plaschka et al. used all the yeast (Saccharo- 
myces cerevisiae) factors except TFIIH to 
visualize the closed and open complexes. 
Despite these differences, the positions of tran- 
scription factors and the trajectory of the nucleic 
acids show excellent agreement between the 
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B-reader 
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Figure 1 | Preparation for transcription. He et al.' and Plaschka et al.” solved structures of the enzyme 
RNA polymerase II (Pol II) in complex with transcription factors (including TFIIB, TFIIE and TFIIF) 

and DNA for humans and yeast. They solved closed and open complexes, which form in the lead-up to 
transcription initiation. a, The human closed complex reveals the trajectory of double-stranded DNA 
across a cleft in Pol II. DNA is enclosed by winged-helix (WH) domains of TFIE and TFIF Arrows show 
the directions in which DNA is moved by a TFIIH translocase enzyme. A flexible ‘B-linker’ region of TFIIB 
(not visible) is disordered in this state. b, In the open complex, about 12 bases of DNA have been threaded 
into the active site of Pol II, leading to the formation of a single-stranded ‘bubble’ from which the blue 
strand will be transcribed. Single-stranded DNA is stabilized by interactions with B-linker and B-reader 
domains of TFIIB. For simplicity, several transcription factors have been removed from this schematic. 


studies. Moreover, the structures are consistent 
with a range of previous analyses* ”. 

TBP recognizes the TATA element — a 
common DNA sequence in eukaryotic Pol I 
promoters. A subcomplex of TATA DNA, 
along with TBP, TFIJA and the cyclin domains 
of TFIIB, interact with the Pol II wall, posi- 
tioning downstream promoter DNA over the 
cleft. A long, flexible region of TFIIB snakes 
through the Pol II exit channel into the active 
site, helping to position DNA correctly. Ina 
major advance, the studies reveal how both 
structured and flexible regions of the tran- 
scription factors TFIE and TFIIF are posi- 
tioned on either side of the cleft, interacting 
with each other, Pol II and a TBP-TFIIB- 
DNA subcomplex to promote and stabilize 
structural transitions during open-complex 
formation. 

How does the complex transition into an 
initiation-competent state? Evidence indi- 
cates'*"* that one subunit of TFIIH is a trans- 
locase enzyme that drives DNA opening by 
threading downstream DNA into the Pol II 
cleft. He and colleagues’ structures for the 
closed and initial transcribing complexes 
support this model, showing that the trans- 
locase maintains contact with DNA in both 
states, and that an extra 12 bases are threaded 
into the cleft by the time that the latter 
has formed. 

The two groups present slightly different 
models for the transition from the closed to the 
open complex. He et al. propose that, during 
the transition, closure of the clamp — driven 
by translocase-generated torsional strain — 
traps single-stranded DNA. Clamp-associated 
TFUE also shifts, and a flexible linker region 
in TFIIB is fixed to stabilize single-stranded 
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DNA (Fig. 1). By contrast, Plaschka et al. 
propose that, in yeast, structural changes 
mediated by TFITE-TFIIB contacts clear the 
cleft of flexible TFIIB segments, allowing entry 
of the template strand, followed by closure of 
the clamp. In this model, TFIE, and probably 
TFIIB, stabilize single-stranded DNA in the 
open complex. 

Discrepancies between the models could 
reflect species differences in transcription 
factors or promoter sequences, differences 
in the size of the DNA bubble, or the absence 
of TFIIH in the yeast system. Both models 
involve small changes in flexible segments 
of TFIIB, TFIIE and TFIIF, so, despite the 
remarkable resolution achieved, the struc- 
tures might still have insufficient resolution 
to identify subtle differences. Moreover, these 
static structures capture only single states of a 
dynamic, multi-step mechanism. Perhaps the 
torsional strain that drives these transitions 
affects the structural state of the complexes — 
an area for future study. 

The current research highlights similarities 
and differences in initiation mechanisms 
between all multi-subunit Pols. For instance, 
bacterial-DNA unwinding is initiated when 
a wedge structure in sigma flips out a base, 
leading to the opening of the DNA bubble’””®. 
TFIIB, which has a completely different 
amino-acid sequence from sigma, interacts 
with the same segment of promoter DNA. 
Both factors stabilize the open state and must 
be ejected from the active site before transcrip- 
tion progresses into elongation mode. Further- 
more, eukaryotic Pols I, I] and II all use TBP, a 
TFIIB-like factor and subunits related to TFIIE 
and TFIIF (ref. 17), but Pol IT is the only one 
to require a translocase. The reason for this 
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is unknown, but probably involves greater 
stability of the Pol I and III open complexes, 
analogous to the stable bacterial open complex. 

These fantastic new structures provide a 
framework for interpreting many past results, 
and are a harbinger of future investigations. 
For instance, a recent structure’® of a TBP- 
containing complex called TFIID, which helps 
to activate transcription, revealed that some 
TFIID subunits interact with promoter DNA. 
This large structure can be docked into the 
current studies’ closed-complex structures — 
suggesting unexpected interactions between 
the basal transcription factors and TFIID sub- 
units that may be important in transcriptional 
regulation. 

Eventually, it should be possible to directly 
visualize how multi-subunit coactivator com- 
plexes, such as TFIID, or another coactiva- 
tor, Mediator, interact with the transcription 
machinery. Mediator is missing from the 
authors’ structures, but is required for nearly 
all Pol II transcription’’. The carboxy-terminal 
domain of Pol II’s largest subunit, a key regula- 
tory target”, is also missing. Rapidly advancing 
technologies for structural biology hold great 
promise for many more interesting insights 
into the regulation of gene expression. m= 
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The use of large-scale genomic and drug response screening of cancer cell lines depends crucially on the reproducibility 
of results. Here we consider two previously published screens, plus a later critique of these studies. Using independent 
data, we show that consistency is achievable, and provide a systematic description of the best laboratory and analysis 


practices for future studies. 


therapeutics and determinants of drug sensitivity. With the 

rapidly growing recognition of intra-tumour and inter-patient 
heterogeneity, the optimal use of these models demands a comprehensive 
characterization of their molecular features and a sufficient number of 
models to capture the genomic diversity observed across human cancers. 
Two recent projects, the Genomics of Drug Sensitivity in Cancer (GDSC) 
and the Cancer Cell Line Encyclopedia (CCLE), collectively characterized 
more than 1,000 cell lines and nearly 150 anti-cancer agents’”, providing 
a potentially valuable resource for the research community. However, 
a subsequent meta-analysis published in this journal asserted that drug 
response results failed to agree®. As a consequence, Haibe-Kains et al.° 
expressed serious concerns about the validity and use of these resources, 
thus raising questions about screening projects of this type more gen- 
erally. To address these concerns and to determine the causes for the 
controversy, we independently characterized the response of 410 cancer 
cell lines to 16 agents tested by GDSC and CCLE. We also evaluated three 
specific aspects of the screening protocols that are relevant to measured 
drug response: readout of cell viability (metabolic versus DNA content), 
seeding density strategy, and cell culture media conditions. For screens 
of this type to be useful, they must accurately identify drug-sensitive and 
drug-resistant cell lines, as well as biomarkers of drug response that gener- 
alize to other contexts and data sets. Using more robust viability statistics 
and concordance metrics that are aligned with these two key screening 
objectives, we showed strong agreement between our own results and 
those generated by CCLE, thereby demonstrating that reliability is achiev- 
able. Concordance with GDSC was weaker, although still appreciable. Our 
data further demonstrated that, for several drugs, differences in media 
conditions and seeding density contributed to residual differences 
between studies. Unexpectedly, we found that metabolic and DNA con- 
tent readouts produced very similar estimates of drug sensitivity, but 
that the DNA content readout was markedly noisier. Taken together, our 
results revealed the technical and analytical parameters that are crucial 
for achieving replicable results. These findings should be of value in 
ensuring the robustness of data generated by similar efforts, and in max- 
imizing the utility of cancer cell line panels for the study of anti-cancer 
agents and biomarkers of drug sensitivity. 


| | uman cancer cell lines are vital models for studying anti-cancer 


Results 
The Genentech Cell Line Screening Initiative (gCSI) was undertaken 
independently of CCLE and GDSC. Nonetheless, gCSI had profiled 


410 cell lines and 16 drugs that were also evaluated by the other projects. 
Figure 1 and Extended Data Fig. 1 compare the mean of fitted viability 
values (equivalent to the familiar area under the curve, Methods) as well 
as 50% inhibitory concentration (ICs , Methods) statistics produced by 
gCSI with those from CCLE or GDSC. As expected, chemotherapeutics 
typically inhibited growth of nearly all cell lines to some extent, while 
targeted agents such as the selective kinase inhibitors erlotinib or lapatinib 
only affected a small number of cell lines that are presumably dependent 
on specific pathways. 


Identifying sensitive or resistant cell lines 

Figure 1 and Extended Data Fig. 1 provide a useful overview of the data 
as well as continuity with the presentation of Haibe-Kains et al. (figure 2 
in ref. 3). However, this representation of drug response has two crucial 
shortcomings that, if unaddressed, confound cross-study compari- 
sons. First, truncation to tested doses (for ICs) or to the 0-100% range 
(as done by GDSC for mean viability) obscures important aspects of the 
data distribution and yields large numbers of ties, which are not appro- 
priate for many standard statistical approaches. Second, although ICs» is 
a familiar and interpretable statistic and is often appropriate for smaller 
screens, it is not well-suited to high-throughput screening. For the large 
number of cell lines that never achieve 50% inhibition, it forces either 
truncation or extrapolation, both of which are potentially problematic. 
Furthermore, it can produce high-variance estimates (see Extended Data 
Fig. 2 for illustration). The mean viability statistic avoids both of these 
problems, and for this reason, we use it exclusively in all analyses that 
follow. 

Because high-confidence identification of groups of sensitive and 
resistant cell lines is a primary objective of profiling, we first compared 
gCSI and CCLE results on this basis. For targeted agents expected to 
inhibit only a few lines (Fig. 2a), we fit two-component mixture distri- 
butions to assign ‘sensitive; ‘no-call’ or ‘resistant’ labels (Extended Data 
Fig. 2, Methods). For broadly active compounds (Fig. 2b), it was more 
appropriate to divide lines into the three response categories in equal 
proportion. It is common to omit unclassified lines or those exhibiting 
intermediate response; in cases where both data sets confidently assigned 
a sensitive or resistant label, the strong majority of calls were concordant 
(84-96%, Table 1). Furthermore, cell lines with molecular features previ- 
ously associated with drug response (for example, ERBB2 overexpression 
for lapatinib) were almost entirely absent from regions where one or both 
data sets assigned a resistant label (Fig. 2). Thus, independently generated 
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gCSI mean viability (%) 
Figure 1 | gCSI, CCLE and GDSC drug response statistics reveal 
different patterns of response for specifically versus broadly active 
drugs. We include all lines in the two-way gCSI/CCLE overlap in the 
top rows, and all lines in the two-way gCSI/GDSC overlap in the bottom 
rows (see Extended Fig. Data 3 for restriction to only those lines included 
in the three-way overlap). In each panel, n refers to the number of lines 
in common and plotted for each pairwise comparison. Mean viability 


drug response data can reveal very similar sets of high-confidence 
sensitive and resistant lines. 

When gCSI and GDSC results were compared on a similar basis, 
concordance for PD0325901 and paclitaxel was good, although not as 
strong as that observed relative to CCLE (Table 1). This was due in part 
to the lower maximal doses used by the GDSC, and their truncation 
of mean viability at 100%. For two out of three targeted agents, on the 
other hand, gCSI was in slightly higher agreement with GDSC than with 
CCLE (Table 1). However, concordance here was driven by the predom- 
inance of resistant lines, since GDSC seems to have selected very few cell 
lines sensitive to erlotinib and lapatinib (Extended Data Fig. 3). For the 
10 drugs in common between only gCSI and GDSC, concordance 
between these groups was also generally lower than for the five drugs 
common to all three groups (Extended Data Fig. 4). 


Lapatinib 


b Broadly active 
PD0325901 


Paclitaxel 


4199 
Mean viability (%) 
4199 


osdd 
osd) 


25 50 75 100 0 25 50 75 100 

gCSI mean viability (%) 
(equivalent to area under the log-dose/response curve) across tested 
drug concentrations for cell lines as assayed by each pair of projects is 
plotted for specifically active drugs (a) and broadly active drugs (b). 0% 
corresponds to complete growth inhibition at all doses; 100% corresponds 
to viability equal, on average, to untreated control wells. Values are capped 
at a maximum value of 100% for consistency with GDSC published data. 
gCSI values are derived from three biological replicates (Methods). 


Genomic biomarkers for drug response 

We next turned to a second concordance metric aligned with a key screening 
objective: identification of predictive biomarkers of drug response. In 
their original publications, both CCLE and GDSC used a well-established 
penalized regression strategy, the ‘elastic net”, to identify predictive 
biomarkers!”. The mathematics behind this method guarantees that only 
a handful of the tens or hundreds of thousands of candidate molecular 
features are selected. For many applications, such parsimony is highly 
desirable. When contrasting two sets of results, however, it can make 
assessment of underlying similarity difficult, because some features that 
are in fact well-correlated with response may be omitted if others that pro- 
vide equivalent information are selected instead. A similar phenomenon 
has been described when constructing gene expression signatures to 
assign cancer subtype”. Despite this caveat, a direct comparison of elastic 
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Figure 2 | Highly concordant sensitive and resistant labels assigned via 
mean viability. a, For targeted agents, dotted lines indicate boundaries 
between sensitive, no-call and resistant lines (Methods). Expected 
responder cell lines (those with molecular features known to be associated 
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e Agreement: resistant 


e Disagreement Expected responder 


with drug response) are indicated in red for each drug (Methods). b, For 
broadly active compounds, lines were divided into three equally sized 
groups on the basis of their ordered mean viability: sensitive, no-call and 
resistant. 
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Table 1 | Classification concordance 


ANALYSIS Sitayaie 


GDSC 
Agree 
P value Called (%) Sensitive Resistant Disagree Agree (%) P value 
2.0x 10-° 85 2 68 10 89 58x 10° 
5.8x 10-16 66 2 48 4 ce 2.9x 10-3 
3.1% 10-7 59 2 54 98 1.9x 10-3 
10x 10-" - - - - 7 - 
15x 10- 60 ZS 1% 18 70 3.9x 10-3 
9.2 x 10-34 46 55 45 16 86 87 x10-19 


CCLE 
Agree 
Type Drug Called (%) Sensitive Resistant Disagree Agree (%) 
Crizotinib 60 o 146 17 90 
Specific Erlotinib 57 35 100 25 84 
Lapatinib 48 24 100 10 93 
lrinotecan 53 41 38 9 90 
Broad Paclitaxel 47 o7 55 16 88 
PDO325901 50 72 64 5 he) 


Lines called sensitive or resistant by both gCSI and CCLE (or separately, by gCSI and GDSC) were assessed for concordant classification. The percentage of such doubly called lines is given in the 
‘Called’ column. The counts of cell lines called sensitive or resistant by both groups, or with discordant classifications, are listed along with the percentage of doubly called cell lines with concordant 
classifications. P values correspond to a Fisher’s exact test for the sensitive/resistant two-way table. For targeted agents, total concordance was slightly higher for GDSC than CCLE, although 

very few sensitive lines were called among the gCSI/GDSC overlap. For broadly active agents, total concordance was significantly higher for CCLE. 


net results obtained from gCSI and CCLE data revealed substantial simi- 
larity. Specifically, for five of the six drugs common to the studies, known 
correlates of response were recovered from gCSI and CCLE response data 
when fit against either their own genomic data or that of the other study 
(Extended Data Fig. 5a). gCSI was also able to recover a known correlate 
of crizotinib response when fit against its own genomic data (Extended 
Data Fig. 5a). 

To perform a more sensitive comparison, we next applied the elastic net 
to one study's drug response data to select biomarkers, and then separately, 
applied univariate regression to score all candidate biomarkers relative 
to the other study's response data (features that scored strongly had pre- 
dictive value for response). Figure 3 shows that although a small number 
of features selected by one study appeared uninteresting in the other, the 
vast majority of candidate biomarkers selected by either gCSI or CCLE 
also produced extreme association statistics when compared to the other 
study's drug response data. For example, ERBB2 and CDH3 were among 
the genes selected by the elastic net when applied to gCSI response data 
for lapatinib; univariate analysis of CCLE response data produced false 
discovery rate (FDR)-corrected P values of 1.6 x 10~”? and 6.6 x 107 for 
these genes, placing them on the far left of the CCLE distribution. Use of 
discrete sensitive/resistant labels rather than continuous drug response 
produced qualitatively similar results (Extended Data Fig. 6). 


a gCSI elastic net, CCLE univariate 
Crizotinib Irinotecan 
Uni neg (FDR < 0.10): 24 ] Elastic net neg: 12 Uni neg (FDR < 0.10): 731 « Elastic net neg: 34 
@ Uni pos (FDR < 0.10): 23 = Elastic net pos: 6 m@ Uni pos (FDR < 0.10): 1,357 «= Elastic net pos: 41 
Erlotinib Paclitaxel 
Uni neg (FDR < 0.10): 6,299 Elastic net neg: 9 Uni neg (FDR < 0.10):994 Elastic net neg: 89 
@ Uni pos (FDR < 0.10): 4,126 « Elastic net pos: 0 m@ Uni pos (FDR < 0.10): 1,537 «= Elastic net pos: 128 
Lapatinib PD0325901 
Uni neg (FDR < 0.10): 3,004 « Elastic net neg: 14 Uni neg (FDR < 0.10): 1,175 «= Elastic net neg: 21 
@ Uni pos (FDR < 0.10): 2,671 = Elastic net pos: 4 @ Uni pos (FDR < 0.10): 1,127 = Elastic net pos: 16 
Figure 3 | Consistent biomarker scoring for gCSI and CCLE drug 
response data. The elastic net algorithm was applied to identify short lists 
of candidate biomarkers for drug response as assessed by either gCSI or 
CCLE mean viability values, using gCSI genomic data. (Equivalent results 
based on CCLE genomic data are shown in Extended Data Fig. 5b-d.) 
Supplementary Table 4 describes the number of cell lines available for each 
comparison. Extended Data Fig. 5a describes the recovery of biomarkers 
expected to associate with response for each of the six drugs tested by 
both gCSI and CCLE. a, Genomic features were ranked by direction and 
strength of their association with CCLE mean viability (from strongest 


Owing to the truncation of published GDSC mean viability data, gCSI 
could only be compared to GDSC on the basis of biomarker discovery 
using discrete response. Although gCSI and CCLE discretized response 
data permitted identification of known correlates of response for both 
specific and broadly active drugs when assessed on lines in common 
between gCSI and CCLE (Extended Data Fig. 6a), only the latter were 
recovered when the same analysis was run on lines in common between 
gCSI and GDSC (Extended Data Fig. 7a). Furthermore, for drugs with 
specific effect, the algorithm yielded no biomarkers at all for GDSC drug 
response data when run on lines in common between gCSI and GDSC. 
For ten additional drugs in common between only gCSI and GDSC, the 
elastic net performed better—both gCSI and GDSC drug response data 
were capable of producing elastic net hits that scored well in the other 
study for many drugs in this set (Extended Data Fig. 4a, b). 


Methodology contributes to discrepancy 

After demonstrating that independent data sets were capable of revealing 
common responsive and resistant lines and common sets of candidate bio- 
markers, we next characterized potential sources of systematic variation 
between the projects. Comparison of experimental methods highlighted 
three key differences: assay type, cell seeding density, and growth media. 
The specific choices of each project are described in Extended Data Table La. 


b CCLE elastic net, gCSI univariate 


Crizotinib lrinotecan 


Uni neg (FDR < 0.10):513 «Elastic net neg: 0 Uni neg (FDR < 0.10): 2,826 «= Elastic net neg: 2 
@ Uni pos (FDR < 0.10): 81 @ Elastic net pos: 0 @ Uni pos (FDR < 0.10): 6,135 Elastic net pos: 18 


Erlotinib 


Uni neg (FDR < 0.10): 3,230 — Elastic net neg: 13 
m@ Uni pos (FDR < 0.10): 2,555 = Elastic net pos: 0 


Paclitaxel 


Uni neg (FDR < 0.10): 2,376 a» Elastic net neg: 13 
@ Uni pos (FDR < 0.10): 4,109 Elastic net pos: 23 


PD0325901 


Uni neg (FDR < 0.10): 2,196 «= Elastic net neg: 18 
@ Uni pos (FDR < 0.10): 2,639 Elastic net pos: 11 


negative to strongest positive). Shaded regions indicate features with 
significant univariate association (FDR-adjusted P < 0.10). Blue indicates 
negative association with response and red indicates a positive association. 
The univariate ranks of features selected by the elastic net using gCSI drug 
response are indicated with vertical bars, also coloured by direction of 
association. In general, the gCSI-selected features are strongly enriched 

at the extreme ends of the distributions, indicating that they are highly 
relevant to both drug response data sets. b, As for panel a, but applying the 
elastic net to CCLE drug response data and comparing results to univariate 
statistics obtained from gCSI drug response. 


Lapatinib 
Uni neg (FDR < 0.10): 3,637 Elastic net neg: 8 
@ Uni pos (FDR < 0.10): 3,257 = Elastic net pos: 0 
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Figure 4 | Follow-up screen shows SYTO 60 assay closely agrees with 
CTG but is significantly less precise. a, Comparison of mean viability 
between STYO 60 and CTG assays for 24 lines and four combinations of 
growth media and seeding density. Results were largely concordant, with 
SK-N-DZ treated with paclitaxel as the only notable exception. Symbol 
colour indicates seeding strategy; symbol shape indicates media condition. 
Inset plots indicate the widths of inter-replicate 95% confidence intervals 
on mean viability for CTG (C) and SYTO 60 (S). See Extended Data Fig. 8 
for full details. b, Pairwise Pearson correlation of mean viability across 

24 cell lines, combining data from the two broadly active drugs, for 


We were particularly interested in the potential impact of the 
choice of assay for viability readout. Cytotoxic effects can be reason- 
ably assessed by either metabolic readout or live cell count, whereas 
cytostatic effects may produce distinct results in the two viability 
readouts®. For lapatinib and paclitaxel, for example, Haibe-Kains et al.° 
observed a better correlation between CCLE and a GlaxoSmithKline 
(GSK) study’ that also used CellTiter-Glo (CTG) than between 
GDSC and GSK. On this basis they asserted that inconsistencies 
between CCLE and GDSC were partly due to the use of different 
assays’. 

To evaluate the relevance of methodology differences, we under- 
took a follow-up screen that considered each of the variables in a full 
three-way factorial design. Twenty-four lines and four drugs common 
to all three studies were re-assessed with CTG versus the SYTO 60 
fluorescent stain, fixed versus variable seeding, and 5% versus 10% 
FBS (Methods). PD0325901 showed a slight but systematic reduction 
in viability when assessed by CTG (mean reduction of 3.6 percentage 
points relative to SYTO 60, paired t-test excluding SK-N-DZ cells, 
P=1.3 x 107°, Fig. 4a). Notably, for the other three drugs, a compari- 
son of CTG and SYTO 60 mean viability estimates showed no evidence 
of systematic bias (erlotinib, P= 0.77; lapatinib, P= 0.23; and pacl- 
itaxel, P=0.27). SK-N-DZ cells treated with paclitaxel were the only 
exception; however, since this line is only loosely adherent, it may have 
been subject to cell loss during a SYTO 60 washing step. Further, our 
SYTO 60 results for broadly active drugs (including PD0325901) were 
more consistent with CTG values from the primary gCSI (r= 0.58) and 
CCLE (r= 0.47) screens than with the GDSC SYTO 60 values (r= 0.42, 
Fig. 4b). Taken together, these data indicate that, at least for the drugs 
considered in the follow-up screen, biological bias arising from choice 
of viability readout is not a plausible explanation for the differences 
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which use of correlation coefficient is appropriate. Follow-up SYTO 60 
results were more consistent with CTG values from the primary gCSI and 
CCLE screens than with the GDSC SYTO 60 values (rectangle ‘1’). All 
eight follow-up screen results were best correlated with the gCSI primary 
screen, with the strongest correlation (rectangle ‘2, r= 0.74) being the 
same conditions used in the primary gCSI screen. Similarly, follow-up 
results that best correlated with CCLE data were those that matched CCLE 
screening conditions (rectangle °3, r= 0.63). CTG and SYTO 60 values are 
derived from three biological replicates (Methods). 


between CCLE and GDSC. Crucially, results from the SYTO 60 assay 
were significantly more variable than those from CTG. Confidence 
intervals for mean viability (Methods) were significantly wider for 
SYTO 60 than for CTG—for most cell lines, and for all drugs and 
across all media and seeding density conditions (Fig. 4a and Extended 
Data Fig. 8c). Combining these observations, we see that the SYTO 
60 assay typically captured the same information as CTG but it did so 
with far less precision. 

Unlike viability readout, other methodology choices did impact results 
more broadly, although typically with modest effect size. Extended Data 
Figure 8a shows that for some, although not all, drug and cell line com- 
binations, increased FBS had a consistent though weak effect on mean 
viability. For PD0325901 in particular, increasing FBS systematically 
increased mean viability (an average increase of 4 percentage points, 
paired t-test, n= 46, P= 2.3 x 10°, Extended Data Fig. 8b). For other 
drugs, however, no consistent shift of a practically relevant magnitude 
was observed (Extended Data Fig. 8b). 

Seeding density also produced systematics effects. When slow 
growing cell lines were seeded at higher density—allowing control 
wells to become confluent over the course of the assay, and thus 
potentially constraining growth or saturating CTG signal—mean via- 
bility tended to increase and apparent drug sensitivity was reduced. 
Conversely, fast growing cell lines tended to show lower mean viability 
and appeared to be more sensitive when seeded at lower density 
(Extended Data Fig. 9a). Intriguingly, a similar trend was seen when 
comparing gCSI (variable seeding) and CCLE (fixed seeding) mean 
viability data (Extended Data Fig. 9b), suggesting that a small but 
statistically significant portion of the difference between the two result 
sets can be explained by how those projects managed differential 
growth rates. 
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Discussion 

In their critique of the CCLE and GDSC projects, Haibe-Kains et al.’ 
wrote, “Because there is no clear concordance, predictive models of 
response developed using data from one study are almost guaranteed to 
fail when validated on data from another study, and there is no way with 
available data to determine which study is more accurate.” The authors 
of the original studies recently disputed the choice of statistical meth- 
ods used in the critique, and in suggesting alternatives, they highlighted 
the importance of biologically grounded analysis methods®. We strongly 
endorse this point of view. Further, we have presented an important, 
independent data resource and shown that when robust drug response 
statistics are used, and when concordance metrics are aligned with the 
stated purpose of high-throughput cell line drug response profiling, 
consistency can be achieved: consistent identification of cell lines that are 
sensitive or resistant to drug, as well as consistent identification of candidate 
biomarkers that predict response. 

Furthermore, the independently generated gCSI data address the 
question of which of the two original studies is more accurate: gCSI 
data agreed less well with GDSC than with CCLE. The GDSC trun- 
cation of viability statistics as well as lower dose ranges and limited 
sets of sensitive cell lines for some drugs undoubtedly contributed to 
this difference. We have also shown that, in our hands, SYTO 60 pro- 
duced significantly noisier drug response estimates than CTG, even 
while capturing essentially the same biology. Therefore, the GDSC 
choice of SYTO 60 probably further reduced concordance with gCSI 
(and CCLE). 

Finally, we have shown that other aspects of experimental methodology 
systematically affect measured cell viability. In particular, allowing cells to 
become over-confluent, by either over-seeding or using richer media, can 
lead to underestimation of inhibition. While these phenomena explain 
some fraction of the discrepancy between projects, other sources of both 
stochastic and unexplained systematic error undoubtedly remain. Our 
data demonstrate, however, that when appropriate laboratory and analysis 
protocols are applied, these are small enough to permit the generation of 
biomarker results that replicate in other settings, and thus, a realization of 
the potential of large cell line panels for characterization of heterogeneous 
response to anti-cancer agents and therapeutics. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and investigators were not blinded to allocation during 
experiments and outcome assessment. 

Cell lines. Genentech maintains a collection of cancer cell lines obtained from a 
variety of academic and commercial sources (for example, ATCC, DSMZ)’. Cell 
line identity was verified by high-throughput single nucleotide polymorphism 
(SNP) genotyping using Illumina Golden gate multiplexed assays. SNP profiles 
were compared to SNP calls from internal and external databases to determine 
or confirm ancestry. In cases where data were unavailable or cell line ancestry 
was questionable, DNA or cell lines were re-purchased to enable confirmation 
of cell line ancestry. SNP genotypes were assessed each time new stocks were 
expanded from cryopreservation. SNP genotypes were also matched to our SNP 
microarrays (see below), which were subsequently matched to our RNA-Seq and 
exome sequencing data to confirm the matching of samples used for genomic and 
cell-based assays. 

Screening drugs management and quality control. Drugs were obtained from 
in-house synthesis or purchased from commercial vendors. A fully automated 
transfer system by Nova Technology (Innovate Engineering) was used to transfer 
material from a dry library, solubilize with DMSO, and log the solutions into our 
compound management system. A high throughput liquid chromatography mass 
spectrometry/ultraviolet absorbance/charged aerosol detector/chemiluminescent 
nitrogen detector (LCMS/UV/CAD/CLND) system was used to verify identity, 
purity, and concentration of drugs used in the gCSI screens. The LCMS/UV/ 
CAD/CLND system consisted of an LCMS/UV system (Shimadzu) with LC-30AD 
solvent pump, 2020 MS, Sil-30AC autosampler, SPD-M30A UV detector, and 
CTO-20A column oven; a Corona Veo RS CAD (Thermo Scientific); and a model 
8060 CLND (Antek)’°. Drugs with lower than 80% purity and 20% below expected 
concentration were excluded. 

An Echo 555 acoustic drop ejection (ADE) liquid handler (Labcyte) was fully 

integrated in the ultra-high-throughput screening uHTS system to dispense DMSO 
solubilized compounds'!. Nine-point dose-response curves at 1:3 dilution were 
generated using ADE as a means of transferring library compounds at ultra-low 
volume (in nanolitre scale) to achieve direct dilution of compounds. The uHTS 
system delivered assay-ready daughter plates at x 1,000 concentration. A DMSO 
backfill step was performed to achieve an equal volume of DMSO in each well. 
Assay ready drug plates were stored at —80°C until the day of compound addition 
and subjected to a single freeze thaw cycle. Use of ADE technology limited the final 
DMSO concentration in assay plates to 0.1%, which was shown to have a negligible 
effect on cell growth. 
Cell viability assays. Cells were maintained in RPMI-1640, 5% FBS (10% heat- 
inactivated FBS for suspension lines), and 2mM glutamine at 37°C and 5% CO, 
incubators. Cell lines were split at least once before use in cell-based assays. Before 
plating, adherent lines were trypsinized and suspension lines were re-suspended. 
Cells were assessed with a Vi-CELL Cell Viability Analyzer (Beckman Coulter); 
viability of at least 90% for adherent cell lines and 85% for suspensions was required 
for screening. Cells were plated at previously determined seeding densities. For 
each adherent cell line, the optimal seeding density intended to achieve 75% 
confluency at 96 h was assessed by Incucyte (Essen Bioscience). The optimal seeding 
for suspension and mixed suspension/adherent cell lines was 75% signalmax at 96h, 
as assessed by CellTiter-Glo. 

A Multidrop Combi Reagent Dispenser (Thermo Scientific) was used for plating 
2511 cell suspension into barcoded Falcon 384-well black clear bottom plates. Cell 
plates were placed in a 37°C humidified incubator maintained at 5% CO over- 
night. The next day, copies of the x 1,000 daughter plates were taken out to thaw 
at room temperature. Room temperature RPMI-1640 without FBS was added to 
yield x6 intermediate plates. Drug plates were incubated at room temperature for 
Lh. Five microlitres of compound addition was delivered to cell plates using an 
Oasis integrated Liquid Handling Robot Platform (Dynamic Devices) with Liconic 
Incubators and automated barcode tracking. After 72h, cell plates were taken out 
of the incubator and equilibrated to room temperature for 30 min. Twenty-five 
microlitres of CellTiter-Glo reagent (Promega) was added using a Biotek Multiflo 
Microplate Dispenser. Cell lysis was induced by mixing for 30 min on an orbital 
shaker; plates were then incubated at room temperature for 10 min to stabilize 
luminescent signal. It is important to note that many of the blood lines required 
additional time (1-1.5h) to induce cell lysis on the orbital shaker. Luminescence 
readout (relative light units; RLU) was performed on a 2104 EnVision Multilabel 
Plate Reader (PerkinElmer). 

Three to four independent biological replicates were produced. Same-plate tech- 
nical replication was observed to have negligible effect on outcome and therefore 
omitted. 

Cell line authentication and quality control. First, short tandem repeat (STR) 
profiles were determined for each line using the Promega PowerPlex 16 System. 


Sixteen loci (fifteen STRs plus Amelogenin for sex determination) were evaluated: 
D3S1358, THO1, D21S11, D18S51, Penta E, D5S818, D138317, D7S820, D16S539, 
CSF1PO, Penta D, AMEL, vWA, D8S1179 and TPOX. Results from gCSI cell lines 
were compared to external STR profiles of cell lines (when available) to confirm 
cell line ancestry. 

In addition to the initial STR assessment, SNP genotypes were re-checked using 
a Fluidigm multiplexed assays each time new stocks were expanded for cryopres- 
ervation. Resulting SNP profiles were compared to previously generated internal 
and external data (when available), to reconfirm ancestry. In cases where such 
data were unavailable or cell line ancestry was questionable, DNA or cell lines 
were re-purchased. SNPs selection for this purpose was based on minor allele 
frequency and presence on commercial genotyping platforms. The SNP assay 
included rs11746396, rs16928965, 1s2172614, rs10050093, rs 10828176, rs16888998, 
rs16999576, rs1912640, rs2355988, rs3125842, rs10018359, rs10410468, 
rs 10834627, rs11083145, rs11100847, rs11638893, rs12537, rs1956898, rs2069492, 
rs10740186, rs12486048, rs13032222, rs1635191, rs17174920, rs2590442, 
rs27 14679, rs2928432, rs2999156, rs10461909, rs11180435, rs1784232, rs3783412, 
rs 10885378, rs1726254, rs2391691, rs3739422, rs10108245, rs1425916, rs1325922, 
rs1709795, rs1934395, rs2280916, rs2563263, rs10755578, rs1529192, rs2927899, 
1s2848745 and rs10977980. 

All stocks were tested for mycoplasma before and after cells were cryopreserved. 
Two methods were used to minimize false positive or false negative results: Lonza 
Mycoalert kit and Stratagene Mycosensor. Cell growth rates and morphology were 
also monitored for any batch-to-batch changes. 

The cell lines MEL-HO, ONCO-DG-1 and KCI-MOH 1 were used in our study 
and are listed on the NCBI misidentified cell line list (http://www.ncbi.nlm.nih. 
gov/biosample). We chose to maintain them because their utility here depends not 
on their true identity, but on their common use among gCSI, GDSC and CCLE. 
Drug response statistics. gCSI data were processed using the R statistical com- 
puting environment (R Core Team, 2013) and a Genentech-developed analysis 
package, compareDrugScreens, which is available for download. 

To reduce noise and the impact of spatial effects, per-dose viability was com- 
puted relative to median RLU from untreated wells physically near drugged wells 
on the plate. To minimize the impact of isolated aberrant wells, a four-parameter 
log-logistic model relating viability to log-dose was fit by minimizing the mean 
of residual absolute value (as an alternative to standard least squares). Cases with 
a high degree of variability across biological replicates were manually flagged 
and excluded from further analysis. In some cases, grossly aberrant runs were 
also dropped, resulting in fewer replicates. Fitted viability statistics for retained 
biological replicates were integrated via maximum likelihood and a hierarchical 
model that accounted for both within-run noise and cross-run biological varia- 
bility. Specifically, fitted values and their standard errors for the final, synthesized 
curve are estimated on a dose by dose basis, and on the log-viability scale. For a 
given dose, the true viability value for run i is assumed to be subject to biological 
variation, and is thus modelled as a Gaussian random variable ju; N(,1,7°). Given 
ju; the viability estimate for run iis modelled as v; ~ N(,u;,;0/7). In practice, noise 
about fitted curve clearly varies from run to run, so use of run specific 0/7 is impor- 
tant. Single-curve fitted viability (v;) and its associated estimated standard error (s;) 
provide natural plug-in estimates of ju; and o;. These are treated as fixed, and then 
[io and 7 are estimated by numerically optimized maximum likelihood. (See the 
estMLE.R file in the compareDrugScreens package for additional detail.) To avoid 
confidence intervals which fall below 0%, standard symmetric confidence intervals 
are computed on the log-viability scale and then endpoints are exponentiated. 
Note that because, unconditionally, vj + N({1o, 7 +07), this meta-analysis style 
approach induces weighted averaging of runs, with weight inversely proportional 
to within-run noise level. 

The reported ICs9 is the dose at which cross-run estimated inhibition is 50% 
relative to untreated wells, that is, absolute ICs9. The mean viability statistic is the 
arithmetic average of the fitted viabilities at each tested dose. Mean viability is 
equivalent to the area under the log-dose/viability curve divided by the total num- 
ber of tested doses, and is thus on an interpretable percentage scale. Because the 
mean viability statistic depends on dosing, it is not expected to produce identical 
quantitative values for projects using different dosing schedule; it should, however, 
produce correlated results and order responsive lines similarly. 

Published CCLE viability statistics differed from those used by gCSI and GDSC 
in an important respect. CCLE first profiled all cell lines with a high dose of MG132 
(a potent cell-permeable proteasome inhibitor). They then defined the minimum 
attainable viability for a given cell line (nominal ‘0’) as the viability observed upon 
treatment of that line with MG132. gCSI and GDSC, on the other hand, took the 
standard approach of equating nominal ‘0’ and true 0—that is, no observed fluo- 
rescence. Absolute and MG132-relative viability and ICso statistics are not directly 
comparable. To enable comparison, we reversed the MG132 adjustment made 
by CCLE and then refit dose response curves to the CCLE data using the gCSI 
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algorithms. (Both the code necessary for the results of this process are provided 
in the compareDrugScreens package.) All CCLE mean viability and ICs statistics 
referenced in this manuscript are therefore on absolute rather than MG132-relative 
scale (Supplementary Table 2). GDSC did not provide per-dose viabilities, so their 
data could not be reprocessed in this fashion. 

Additional investigation of published CCLE dose-response curves identified 
a small fraction with erroneous reported ICs9 values; typically, those with per- 
dose viability values that were either all above or all below the 50% threshold. The 
recalculation of absolute viabilities described above also corrected this problem. 
Assigning sensitive and resistant labels. Specifically active targeted agents were 
assumed to have no effect on most cell lines, but to inhibit a small fraction of lines 
dependent on the relevant pathways, to varying degrees. To model this behav- 
iour, two component Gaussian mixture distributions were fit to untruncated gCSI 
and CCLE mean viability data for crizotinib, erlotinib and lapatinib, using mclust 
(http://bioconductor.org). Although Extended Data Fig. 2a gives bivariate fits for 
illustration purposes, final sensitive/resistant thresholds were assigned one study 
at a time. Variance for the two components was not required to be equal. Cell lines 
more than two standard deviations below the fitted resistant group mean were 
called sensitive. The fitted sensitive group was more dispersed since, unlike the 
resistant group, variation here reflects both assay noise and true variation in degree 
of sensitivity to the drug; to accommodate this, lines more than one standard 
deviation above the fitted sensitive group mean were called resistant. Lines that 
did not meet either of these criteria we assigned no-call. Extreme outliers (>1.05 
for gCSI and >1.10 for CCLE) were excluded from fitting, but not classification, 
for crizotinib, erlotinib and lapatinib as the two-component fit is susceptible to 
assigning these rare outliers their own distribution. 

This strategy could not be applied to the GDSC mean viability results owing 
to truncation of published statistics, which precludes the use of Gaussian distri- 
butions. Instead, we used the same cutoffs for gCSI as computed above, counted 
lines called sensitive or resistant by gCSI among those in the gCSI/GDSC overlap 
set, and then assigned the same number of sensitive or resistant labels to those cell 
lines with most extreme GDSC mean viability scores. GDSC truncation of mean 
viability values at 1 often resulted in ties; these were resolved at random in order 
to select an appropriate number of resistant lines. 

Broadly active agents were assumed to affect most or all lines, although to widely 
varying degrees. This makes use of mixture distributions inappropriate. For these 
compounds, we simply split cell lines into three even groups and assigned sensitive, 
no-call and resistant labels. 

When assigning labels for data from a given institution, all available lines from 
that institution were used, even those not screened by other groups. 

Expected biomarkers and expected responder cell lines. For Fig. 2 and 
related analyses, we selected sets of cell lines (termed ‘expected responders’) 
with the clearest evidence of genomic features previously shown to correlate 
with response to each of six drugs. These lines were selected using the gCSI 
feature-standardized genomic data matrix, so all cutoffs for quantitative vari- 
ables are in standard (z-score) units. For crizotinib, the following features and 
cutoffs were used: MET amplification (copy number z-score >3.0) or evidence 
of EML4/ALK translocation. Further, the cell line OE33 lacked copy number 
data but is MET amplified; the cell line NB-1 expresses an AML4/ALK fusion 
transcript’? and SR-786 has a NPM1/ALK fusion!>. Both were included as expected 
responders to crizotinib. For erlotinib, we selected lines with EGFR hotspot muta- 
tion (Supplementary Table 6) or high EGFR expression; for irinotecan, lines with 
high SLFN11 expression’"; for lapatinib, lines with EGFR hotspot mutation, high 
EGER expression or high ERBB2 expression. The biomarker for paclitaxel was low 
ABCBI expression». Finally, the biomarkers for PD0325901 were a positive value 
for the MAPK metafeature or high SPRY2 expression. Gene expression cutoffs 
were selected based on inspection of expression density plots (not shown). A cutoff 
of 2 was used to isolate the upper tail of unimodal distributions (EGFR, ERBB2, 
SPRY2), while a cutoff was placed in the gap for bimodal distributions (—1 for 
ABCBI1, 0 for SLFN11). 

Selecting lines for follow-up screening. Twenty-four lines were selected for the 
factorial assessment of viability assay, growth media, and cell seeding strategy. 
Because the SYTO 60 protocol is challenging to implement with cells in suspension’, 
we restricted attention to adherent lines. Twelve specific features of the 177 
cell lines available in all three studies were ranked. These features quantified 
differences in mean viability or ICs) among the three projects, availability of 
response data for cases where many lines were missing values, and relative seeding 
density in the gCSI primary screen. Cell lines with the highest average rank were 
selected. 

Follow-up screen. Cells were maintained in RPMI-1640, 5% FBS and 2mM glu- 
tamine. Cells were assessed with a Vi-CELL Cell Viability Analyzer (Beckman 
Coulter); viability of at least 90% for all cell lines was required for screening. Cells 
were plated at optimal seeding density to a previously determined, line-specific 
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level intended to achieve 75% confluency at 96h using an Incucyte (Essen 
Bioscience). In addition, the same cell lines were also plated at a fix seeding 
density of 1,020 cells per well. To address whether serum concentration had an 
effect on drug response, cells were plated at 5% and 10% respectively. Multidrop 
Combi Reagent Dispenser (Thermo Scientific) was used for plating cells into 
barcoded Falcon 384-well black clear bottom plates. Cell plates were placed in 
a 37°C humidified incubator maintained at 5% CO) overnight. A 9-point dose 
dilution was performed using an Echo 550 (Labcyte). The next day, compound 
addition was done using a Bravo Automated Liquid Handling Platform (Agilent 
Technologies). After 72h, 25\1l CTG reagent (Promega, G7573) was added using 
a Biotek Multiflo Microplate Dispenser. Cell lysis was induced by mixing for 2 min 
on an orbital shaker; plates were then incubated at room temperature for 10 min 
to stabilize luminescent signal. Luminescence readout (RLU) was measured on a 
2104 EnVision Multilabel Plate Reader (PerkinElmer). An identical set of assay 
plates was performed using SYTO 60 Red Fluorescent Nucleic Acid Stain method 
(Life Technologies, $11342) to compare different types of endpoint readout. Cells 
were fixed for 30 min using 4% formaldehyde prepared in PBS. Formaldehyde was 
aspirated and plates were washed 3 times using automatic liquid handling. Cell 
nuclei were stained using the SYTO 60 at 1 uM, 50,1 per well (384-well plate). 
Plates were incubated in the dark for 1 h at room temperature. Dye was aspirated 
and plates were washed three times using automatic liquid handling. Fluorescence 
intensity is measured (plate bottom measurement) by EnVision at 652 nm/678 nm. 
Three independent biological replicates were produced; same-plate technical rep- 
lication was observed to have negligible effect on outcome and therefore omitted. 
Gene expression. Expression levels were quantified by RNASeq via Illumina 
sequencing (75-bp paired-end reads) in one sample per cell line. Reads were filtered 
for quality and for rRNA contamination. These reads were aligned to the genome 
(GRCh37.1) using GSNAP (version 2013-03-31)'* with the following options: -M 2 -n 
10 -B 2 -il -N 1 -w 200000 -E 1-pairmax-rna = 200000 -clip-overlap. An average of 
51.3 million uniquely aligned reads with concordant read pairs were available per 
sample. Read counts for each transcript were processed with the DESeq variance 
stabilizing transform (VST) method” to obtain normalized expression levels. The 
resulting expression measurements are referred to as “VST values. 
DNA copy number and loss of heterozygosity. Illumina Human Omni 2.5-4 or 
Omni 2.5-8 arrays, one per cell line, were processed with a modified version of the 
PICNIC (ref. 18) algorithm, as published recently’’. In short, the modifications 
to PICNIC increase the accuracy of global ploidy (and normal cell contamina- 
tion) estimation and make the algorithm compatible with Illumina arrays. These 
modifications also allow for accurate total copy number estimates and dramat- 
ically reduce over-segmentation. After PICNIC determines the sample ploidy 
and performs segmentation with its hidden Markov model (HMM) to determine 
allele-specific copy number, we added a final segmentation with cghFLasso~”. This 
addition accounts for the over-segmentation of PICNIC in regions where point 
copy number estimates indicate non-integer total copy number. This segmented 
total copy number was averaged for each gene using all SNP intervals that include 
a portion of that gene. We used the absolute copy number scale (where 2 is normal 
for an autosome) rather than the traditional relative log, absolute/ploidy copy 
number. Loss of heterozygosity (LOH) was assessed using the lesser of the two 
allele-specific copy number values (LACN), from the PICNIC HMM, using the 
same SNPs. Genes with an average LACN value <1 were considered to have LOH. 
Absolute total copy number values and binarized LOH values were included in the 
model for every gene. 
Mutations. DNA mutations were assessed by Illumina exome sequencing (75-bp 
paired-end reads) in one sample per cell line (manuscript in preparation). GSNAP 
(version 2013-03-31)'° was used to align reads to the genome (GRCh37.1) with the 
following options: -M 2 -n 10 -B 2 -i 1-pairmax-dna = 1000-terminal-threshold = 
1000-gmap-mode = none-clip-overlap. Duplicate alignments were filtered using 
PICARD tools (http://broadinstitute.github.io/picard). An average of 64.7 mil- 
lion uniquely aligned reads with concordant read pairs were available per sample. 
These aligned reads were used to assess genomic SNV and indel variants using the 
VariantTools (http://bioconductor.org) package (version 1.4.5) and the following cut- 
offs: a minimum of 2 reads supporting the variant and at least 5% variant frequency. 

We assessed our cell lines for mutations in 235 hotspot codons in specific RefSeq 
transcripts for 68 genes (Supplementary Table 6). As mutation consequences are 
evaluated with respect to a specific transcript, we ensured that the selected tran- 
scripts indicated the expected wild type amino acids at the hotspot codons. Cell 
lines with any missense mutation within a hotspot codon were counted as mutant 
for that hotspot. A cell line lacking a variant at a given hotspot codon was counted 
as wild type if all three bases in the codon had sufficient coverage to detect a 
variant, with 80% power, given the VariantTools cutoffs above, and otherwise was 
considered a no-call (NA). 

Hotspots were also combined, by gene, such that a cell line with any hotspot 
mutation in a given gene was considered positive for that hotspot metafeature. 
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For example, a cell line with a PIK3CA missense mutation at amino acid 542, 545 
or 1047 was considered positive for the PIK3CA hotspot metafeature. 

To address tumour suppressor genes or previously uncharacterized muta- 
tions in oncogenes, three levels of mutations were considered for each coding 
position in each gene. First, mutation consequences were annotated using the 
Ensembl Variant Effect Predictor (VEP)*!. VEP implements a controlled vocab- 
ulary, and VEP vocabulary terms in the following set were considered to always 
be function altering mutations (and coded as mut): TRANSCRIPT_ABLATION, 
ESSENTIAL_SPLICE_SITE, STOP_GAINED, FRAMESHIFT_CODING, STOP_ 
LOST. Missense mutations that had previously been observed one or more times 
in patient tumours (COSMIC, version v63_300113)”” were also considered to be 
function-altering mutations (mut). Second, a larger mutp class was formed by com- 
bining mut variants with non-COSMIC missense variants assessed as ‘damaging’ 
by the Condel tool*’. Finally, variants that met the mutp criteria and were also 
within a segment of LOH were coded as mutpl. For each of these three categories 
of mutation, a sample was called wild type for a given gene if it had no variants and 
also had sufficient coverage at >50% of coding nucleotides; genes with no variants 
but insufficient coverage were considered NA. 

Finally, we also formed three pathway metafeatures by combining mutations. 

The PI3K metafeature was positive for lines with a PIK3CA hotspot mutation, 
PTEN LOH, or PTEN loss. The DNA damage metafeature was positive for any cell 
line with a mutp variant in MLH1, MLH3, KMT2D (also known as MLL2), MSH2, 
MSH3, MSH6, PMS1 or PMS2. The MAPK metafeature was positive for any cell 
line with a hotspot mutation in BRAF, HRAS, KRAS, or NRAS. 
GDSC and CCLE data. For CCLE, DNA copy number and expression were down- 
loaded from GEO (accession number GSE36139). Mutation and cell line descrip- 
tion data were downloaded from the CCLE website (http://www.broadinstitute. 
org/ccle/home). Drug and cell line names were matched across data sets. (Note 
that the cell line “I:T’ was renamed ‘tdott’ to differentiate from cell line “TT’.) As 
described above, the CCLE ICs) and mean viability values were re-calculated using 
the provided per-dose viability values from ref. 24 (see Supplementary Table 2). 

The GDSC mean viability and ICs» values (gdsc_manova_output_w2.csv) were 

downloaded from ftp://ftp.sanger.ac.uk/pub4/cancerrxgene/releases/release-2.0 
as in ref. 3. Note that GDSC mean viability statistics were not included in the 
original publication. Because only summary statistics were provided, GDSC mean 
viability values were used as is. The GDSC extrapolated ICs values, on the other 
hand, were truncated to the limits of the tested dose range for consistency with 
gCSI and CCLE data. 
Predictive modelling. In preparation for predictive modelling, several cell line 
features were gathered: tissue of origin; growth rate as assessed by seeding density 
(the number of cells per assay control well required to reach 75% confluency at 
72h); and doubling time. 

All sample descriptors were combined into one matrix for predictive modelling. 
Expression VST values, DNA copy number, seeding density and growth rate were 
expressed as continuous values. Mutation status for individual hotspots, for gene 
hotspot and pathway metafeatures, and for the three categories of function-altering 
mutation were each individually coded as binary variables. Categorical variables, 
such as tissue type, were coded with one binary dummy variable per factor level. 
For example, the value ‘1’ for the ‘isLung’ feature indicates that a cell line is derived 
from lung cancer. All features, including the binary features, were standardized 
before imputation and predictive modelling. 

Missing data, owing to a lack of genomic data or insufficient sequencing cov- 
erage, were imputed using glmnet*. Missing values were initially replaced with 
the average value for their column (feature). Missing data were then imputed, one 
feature at a time, in a serial fashion, by treating one column as the response variable 
and all other feature columns as dependent variables. Training was performed 
using all samples with data for the response variable and values were then predicted 
for the samples previously missing data. 

Classification and feature selection were performed using glmnet* with bina- 
rized or continuous valued mean viability as the response variable. We selected an 
a value of 0.5 to increase the richness of the selected features and to increase inter- 
pretability. We selected the minimum lambda value within one standard deviation 
of the absolute minimum value under cross validation. This cross validation was 
performed with 25 randomly selected hold-out groups consisting of 20% of the 
samples. Hold-out groups were selected using the ‘caret’ (http://bioconductor.org) 
tool to select sample groups with sensitive/resistant proportions or continuous 
response variable distributions similar to that of the full set of samples. Following 
cross validation and lambda selection, a glmnet run with all samples was performed 


to select potential biomarkers. All features with a non-zero coefficient under the 
selected lambda were used for further analyses. 

Owing to large differences in genomic feature naming and availability, it was 

necessary to use only one genomics data set at time when scoring genomic fea- 
tures for correlation with drug response and selecting candidate biomarkers. gCSI 
and CCLE mean viability data were each compared to the gCSI genomics data 
(Fig. 3 and Extended Data Fig. 5a), and then separately to the CCLE genomics 
data (Extended Data Figs 5b-d). A similar analysis was performed for gCSI ver- 
sus GDSC discretized mean viability statistics, relative to gCSI genomics data 
(Extended Data Fig. 7). Owing to sporadically missing drug response data and 
differences in the sets of lines for which gCSI and CCLE produced genomics data, 
the number of cell lines used in each comparison varied (Supplementary Table 4). 
For Fig. 3 and Extended Data Figs 5 and 6, only cell lines with gCSI and CCLE drug 
response data and genomic data were used. For Extended Data Fig. 7, only samples 
available in the gCSI and GDSC drug data and gCSI genomics data were used. 
Univariate analysis of drug response and genomics. Drug response data and 
standardized genomics data used for predictive modelling (above) were also ana- 
lysed one genomic feature at a time. Continuous drug response data was regressed 
on each genomic feature using standard least squares. Logistic regression was used 
for discretized drug response data. Correction for multiple testing was performed 
using the Benjamini and Hochberg method as implemented in the R package 
multtest (http://bioconductor.org). 
Code and data availability. All source code used in this study has been made 
available in the R computer language, in a fully documented software and data 
package: compareDrugScreens. This package is freely available under the Creative 
Commons 3.0 license and can be downloaded from http://research-pub.gene.com/ 
gCSI-cellline-data. All gCSI, CCLE and GDSC drug and genomic data used in this 
study are also included in this package. 
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Extended Data Figure 1 | Comparison of gCSI, CCLE and GDSC mean 
viability response statistics: all pairwise comparisons. a, b, [C59 values 
(the drug concentration estimated to produce a 50% reduction in viability 
relative to untreated cells) for the cell lines in Fig. 1. Because extrapolation 
beyond tested doses can produce statistically unstable results (Extended 
Data Fig. 2b), ICso values are truncated to the tested dose range (which 
varies by drug and project). c, d, Mean viability (equivalent to area under 


the log-dose/response curve) is plotted for all drugs that gCSI had in 
common with one or both other studies. 0% corresponds to complete 
growth inhibition at all doses; 100% corresponds to viability equal, on 
average, to untreated control wells. Values are capped at a maximum value 
of 100% for consistency with GDSC published data. gCSI mean viability 
and ICs» values were derived from three biological replicates (Methods). 
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Extended Data Figure 2 | Assignment of sensitive and resistant labels 
for specifically active agents. a~c, Untruncated mean viability statistics 
from CCLE and gCSJ are plotted for three drugs with specific activity. 
Two-component Gaussian mixture model fits to these values were 
performed to classify non-responder and responder distributions for 
CCLE and gCSI separately”®. Non-responder mean viability statistics 
were scattered around 100%, providing a good indication of assay noise 
levels. Point colour indicates hard sensitive (red) versus resistant (blue) 
thresholding based on likelihood ratio—a strategy that ignores uncertainty 
of classification. Dotted lines, on the other hand, indicate the final 
thresholds selected by the method used for Fig. 2; this method quantifies 
uncertainty and, if it is too high, assigns a no-call label to the cell line 
(Methods). Ellipses indicate the covariance from a two-dimensional, 
two-component fit and are for illustrative purposes only; response labels 
were assigned on the basis of individual fits for gCSI and CCLE separately, 
using all available cell lines for each data set. d-f, Hypothetical log-dose/ 
viability curves illustrating statistical challenges presented by ICs 
response metric. In small screens, it is often feasible to adjust dosing scale 
so that a 50% inhibition (relative to vehicle-treated wells at the same time 
point) is cleanly achieved. For large-scale screening, on the other hand, it 
is common for many cell lines to fail to achieve this level of inhibition at 
any tested dose, and/or to fail to demonstrate a lower asymptote in their 
dose-response curve. The mean viability statistic, on the other hand, 
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requires no extrapolation beyond tested doses. d, A hypothetical 
dose-response curve with characteristics ideal for relative or absolute 
ICs computation. Tested doses yield viability values well above and 

well below 50%. Furthermore, both upper and lower asymptotes can be 
precisely estimated, enabling use of relative ICs0, if desired (relative ICs9 is 
computed after first rescaling to the observed viability dynamic range. 

In the present work, however, all three studies use absolute ICs9). 

e, A hypothetical dose-response curve that yields precise upper and lower 
asymptote estimates but a highly imprecise absolute ICs) estimate: doses 
over a range exceeding two orders of magnitude are all consistent with 
50% inhibition, and raw data are too noisy for further refinement. Use of 
such ICs9 values, which can be frequently observed in gCSI, CCLE and 
GDSC results, may complicate cross-study comparisons. f, A hypothetical 
dose-response curve that requires potentially imprecise extrapolation 

for estimation of both absolute ICs) and lower asymptote. We advise 
against use of either absolute or relative ICs) for such curves, which are 
again common in all three studies under consideration. Finally, also note 
that in settings such as this, where low doses provide no information and 
inhibition occurs only at the highest tested doses, any response statistic is 
forced to rank cell lines on the basis of just a subset of the data generated. 
As a consequence, results for such drugs may exhibit reduced concordance 
regardless of analysis method. 
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Extended Data Figure 3 | Comparison of gCSI, CCLE and GDSC 
primary drug response statistics: only cell lines assayed by all three 
projects. Equivalent to Fig. 1, but restricted to cell lines in the three-way 
gCSI, CCLE and GDSC intersection. Figure 1 shows a significant number 
of cell lines interrogated by gCSI and CCLE that are sensitive to the three 
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specifically active drugs. Of these, few (if any) are visible here, because 
few were included in the gCSI-GDSC intersection. a-d, Mean viability 
statistics (a, b), and ICso (c, d). Both are truncated as for Fig. 1 and 
Extended Data Fig. 1. gCSI mean viability and ICs values were derived 
from three biological replicates (Methods). 
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C Agreement rate for doubly-called cell lines screened only by gCSI and GDSC 


GDSC 


Agree 
Called (%) Sensitive Resistant Disagree Agree (%) 


specific bid1870 30% 13 40 23 70% 7.7E-03 

broad bortezomib 52% 13 13 22 54% 0.57 

broad docetaxel 48% 46 49 25 79% 1.6E-10 

broad doxorubicin 47% 57 53 26 81% 9.6E-14 

specific gdc0941 47% 3 103 5 95% 8.6E-04 

broad gemcitabine 44% 42 59 24 81% 4.2E-12 

specific ms275 62% 2 33 21 63% 0.71 

specific rapamycin 44% 9 26 7 83% 1.8E-04 

broad thapsigargin 46% 41 56 36 73% 2.2E-07 

broad vorinostat 50% 45 51 26 79% 2.2E-10 
Extended Data Figure 4 | gCSI-GDSC concordance for drugs common response are indicated with vertical bars, also coloured by direction of 
to only gCSI and GDSC. a, Genomic features were ranked by direction association. b, As for panel a, but applying the elastic net to GDSC drug 
and strength of their association with GDSC mean viability (from response data and comparing results to univariate statistics obtained from 
strongest negative to strongest positive). Shaded regions indicate features gCSI drug response. c, As in Table 1, lines called sensitive or resistant by 
with significant univariate association (FDR-adjusted P < 0.10). The both gCSI and GDSC were assessed for concordant classification. 
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A Features identified by elastic net for g¢SI and CCLE mean viability data (gCSI genomics) 


Selected positive control features Total features 


Target Expected Biomarkers gCSI CCLE gCSI CCLE Shared Genes 


MET, HGF expression, 


crizotinib ALK, MET MET copy number ST7-AS1 18 0 
EGFR mutation, CDH3, EGFR, ITGB6, 
erlotinib EGFR expression, or copy EGFR EGFR 9 13. LOC100507500, 
number LOC100652762 
EGFR or ERBB2 copy 
lapatinib ERBB2 number or expression, ERBB2 ERBB2 18 8 ERBB2, MIR4728 
EGFR mutation 
irinotecan Topoisomerase SLFN11 expression SLFN11 SLFN11 75 20 SLFN11, SQSTM1 
ABCB1, ARF4, BCL2L1, 
paclitaxel Tubulin ABCB1 expression ABCB1 ABCB1 217 36 Pilon LOCA Nad ese: 


OR6A2, SLC35F5, SSRP1, 
TRPC4AP 

DNAJA4, DUSP6, ERC2, 
MAPK metafeature, 37 29 GUF1, HSD17B11, HSD17B6, 
SPRY2, DUSP6 MAPK metafeature, RNF125, 
SPRY2, TMEM63C 


SPRY, DUSP, ETV4 SPRY2, MAPK 
pd0325901 MEK expression, RAS/RAF __metafeature, ETV4, 
mutation DUSP6 


b Features identified by elastic net for gCSI and CCLE mean viability data (CCLE genomics) 


Selected positive control features Total features 


Target Expected Biomarkers gCsl CCLE gCSI CCLE Shared Genes 


MET, HGF expression, 


crizotinib ALK, MET MET copy nunibel 0) 0) 
EGFR mutation, 
erlotinib EGFR expression, or copy EGFR EGFR 8 8 CDH3, EGFR, KLF5, MAL2 
number 
EGFR or ERBB2 copy 
lapatinib ERBB2 number or expression, ERBB2 ERBB2 18 10 ERBB2, KLF5, MAL2, RHOV 
EGFR mutation 
irinotecan Topoisomerase SLFN11 expression SLFN11 SLFN11 66 16 ATP6VOE1, SLFN11 
' ; ; ABCB1, PPP1R9A, SSRP1, 
paclitaxel Tubulin ABCB1 expression ABCB1 ABCB1 35 25 TMEM139 
BRAF, CCDC107, ETV4, 
teen IVSe Seinen ree SPRY2, ETV4,BRAF, SPRY2,ETV4,DUSP6, 4, HIVEP1, HSD17B6, LTBP1, 
P pie ; KRAS BRAF MAGEE1, NPAS2, RNF125, 
¥ SFXN3, SPRY2, SSFA2 
Cc gCSI Elastic Net, CCLE Univariate d CCLE Elastic Net, gCSI Univariate 
crizotinib irinotecan crizotinib irinotecan 
Uni neg (FDR < 0.10):123 «@ Elastic net neg: 0 ©) Unineg (FDR < 0.10): 1354 « Elastic net neg: 39 © Uni neg (FDR < 0.10): 246 «@ Elastic net neg: 0 Uni neg (FDR < 0.10): 3391 «@ Elastic net neg: 8 
™ Uni pos (FDR < 0.10): 45 @ Elastic net pos: 0 ™ Uni pos (FDR < 0.10): 801 = Elastic net pos: 27 ™ Uni pos (FDR < 0.10): 50 @ Elastic net pos: 0 Uni pos (FDR < 0.10): 2261 «= Elastic net pos: 8 
erlotinib paclitaxel erlotinib paclitaxel 
© Unineg (FDR < 0.10): 2325 « Elastic net neg: 8 ©) Unineg (FDR < 0.10): 2468 «= Elastic net neg: 20 ©) Unineg (FDR <0.10): 1860 = Elastic net neg: 8 © Unineg (FDR < 0.10): 3923 «= Elastic net neg: 15 
™  Unipos (FDR <0.10): 3640 «= Elastic net pos: 0 ™  Unipos (FDR <0.10):1509 «= Elastic net pos: 15 a Uni pos (FDR < 0.10): 2853 «= Elastic net pos: 0 ™  Unipos (FDR <0.10): 2209 «= Elastic net pos: 10 
lapatinib pd0325901 lapatinib pd0325901 
©) Unineg (FDR < 0.10): 2138 Elastic net neg: 13 " Unineg (FDR < 0.10): 827 @ Elastic net neg: 46 © Unineg (FDR < 0.10): 2239 «= Elastic net neg: 9 ©) Unineg (FDR < 0.10): 1454 « Elastic net neg: 28 
™ Uni pos (FDR <0.10):3100 «= Elastic net pos: 5 ™ Uni pos (FDR < 0.10): 808 Elastic net pos: 26 ™ Uni pos (FDR < 0.10): 3754 Elastic net pos: 1 ™ — Unipos (FDR <0.10):2119 Elastic net pos: 13 
Extended Data Figure 5 | Consistent biomarker scoring for gCSI and gCSI and CCLE selected feature sets. Asterisk denotes that ST7-AS1 is 
CCLE drug response data. The elastic net algorithm was applied to adjacent to MET and ST7-AS1 expression correlates with MET copy 
identify short lists of candidate biomarkers for drug response as assessed number (Pearson correlation r= 0.47, P< 2.2 x 107!°, n=534). The gCSI 
by either gCSI or CCLE mean viability values, using gCSI genomic data. genomics set does not include translocations, so ALK translocation could 
Supplementary Table 4 describes the number of cell lines available for not be checked directly. b, Equivalent to panel a, but using CCLE rather 
each comparison. a, Description of biomarkers expected to associate with than gCSI genomics data. c, d, Equivalent to Fig. 3a, b, but using CCLE 


response for each of the six drugs tested by both gCSI and CCLE; recovery __ rather than gCSI genomics data. 
of expected biomarkers; total selected feature count; and overlap between 
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@_ Features identified by elastic net for gCSI and CCLE discretized mean viability data (gCSI genomics) 
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Expected Biomarkers gCSl 
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P nate : SPRY2, ETV4, DUSP6 SPRY2, DUSP6 MAPK metafeature, RNF125, 
SH38TC2, SPRY2, ZCCHC9 
b gCSI Elastic Net, CCLE Univariate c CCLE Elastic Net, gCSI Univariate 
crizotinib irinotecan crizotinib irinotecan 
™ Unineg (FDR <0.10):0 = Elastic net neg: 0 | Unineg (FDR <0.10):0 = Elastic net neg: 37 | Uni neg (FDR <0.10): 1 = Elastic net neg: 0 Uni neg (FDR < 0.10): 2332 = _ Elastic net neg: 15 
™  Unipos(FDR<0.10):0 = _ Elastic net pos: 0 ™  Unipos(FDR<0.10):0 = _ Elastic net pos: 35 ™  Unipos(FDR<0.10):305 = _ Elastic net pos: 1 ™ ~~ Unipos (FDR <0.10):1120  — Elastic net pos: 9 
erlotinib paclitaxel erlotinib paclitaxel 
~~ Unineg (FDR <0.10):1709 = Elastic net neg: 2 Uni neg (FDR <0.10): 1425 = Elastic net neg: 49 ~~ Unineg (FDR <0.10):1375 = Elastic net neg: 1 ~~ Uni neg (FDR <0.10): 2483 = Elastic net neg: 26 
™ Uni pos (FDR <0.10):3315 = Elastic net pos: 17 ™ Uni pos (FDR < 0.10): 384 = Elastic net pos: 29 ™ ~~ Unipos (FDR <0.10):1981 = Elastic net pos: 28 © ~~ Unipos (FDR <0.10):1110 = Elastic net pos: 13 
y j = 
i 
lapatinib pd0325901 lapatinib pd0325901 
™  Unineg (FDR<0.10):109 = Elastic net neg: 4 ™  Unineg (FDR <0.10):601 = Elastic net neg: 27 © Unineg (FDR <0.10):909 = _ Elastic net neg: 4 Uni neg (FDR < 0.10): 1840 =< _ Elastic net neg: 52 
™ ~~ Unipos (FDR <0.10):578 = _ Elastic net pos: 27 ™ ~~ Unipos (FDR <0.10):893 = _ Elastic net pos: 19 ™ ~~ Unipos (FDR <0.10):1654 = Elastic net pos: 33 ™ ~~ Unipos (FDR <0.10):1746 = Elastic net pos: 44 


Extended Data Figure 6 | Features identified as response biomarkers 
using gCSI and CCLE discretized mean viability drug response data 
versus gCSI genomics data. Equivalent to Fig. 3 and Extended Data Fig. 5a, 
but using discretized sensitive/resistant labels rather than continuous 
drug response statistics (lines assigned a no-call label were ignored for 
this analysis). Recovery of expected biomarkers and scoring of features 
selected by elastic net in the opposing project’s univariate analysis are both 
comparable to results obtained from continuous mean viability statistics 
(Fig. 3 and Extended Data Fig. 5a). a, Description of biomarkers expected 
to associate with response for each of the six drugs tested by both gCSI 
and CCLE; recovery of expected biomarkers; total selected feature count; 
and overlap between gCSI and CCLE selected feature sets. b, Genomic 


| 


i 


features were ranked by direction and strength of their association with 
CCLE mean viability (from strongest negative to strongest positive). 
Shaded regions indicate features with significant univariate association 
(FDR-adjusted P < 0.10). Blue indicates negative association with response 
and red indicates a positive association. The univariate ranks of features 
selected by the elastic net using gCSI drug response are indicated with 
vertical bars, also colored by direction of association. In general, the 
gCSI-selected features are strongly enriched at the extreme ends of 

the distributions, indicating that they are highly relevant to both drug 
response data sets. c, As for panel b, but applying the elastic net to CCLE 
drug response data and comparing results to univariate statistics obtained 
from gCSI drug response. 
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Extended Data Figure 7 | Features identified as response biomarkers 
using gCSI and GDSC discretized mean viability drug response 

data versus gCSI genomics data. Equivalent to Extended Data Fig. 6, 
but contrasting gCSI with GDSC rather than CCLE. a, Description of 
biomarkers expected to associate with response for each of the six drugs 
tested by both gCSI and GDSC; recovery of expected biomarkers; total 
selected feature count; and overlap between gCSI and GDSC selected 
feature sets. b, Genomic features were ranked by direction and strength 
of their association with GDSC mean viability (from strongest negative 
to strongest positive). Shaded regions indicate features with significant 


univariate association (FDR-adjusted P < 0.10). Blue indicates negative 
association with response and red indicates a positive association. The 
univariate ranks of features selected by the elastic net using gCSI drug 
response are indicated with vertical bars, also colored by direction of 
association. In general, the gCSI-selected features are strongly enriched 
at the extreme ends of the distributions, indicating that they are highly 
relevant to both drug response data sets. c, As for panel b, but applying 
the elastic net to GDSC drug response data and comparing results to 
univariate statistics obtained from gCSI drug response. 
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Extended Data Figure 8 | Media serum concentration affects mean 
viability for some drugs and cell lines. Differences in mean viability 
between the 10% and 5% FBS conditions are shown for the follow-up 
screen. Only CTG-derived mean viability values were used. a, Changes 
in single-run mean viability for variable seeding data before synthesizing 
replicates. In many cases, direction of change was not consistent. In others 
(for example, 769P or CAL-12T when treated with PD0325901), richer 
media had a consistent effect in each replicate. For each drug, overall 
significance of bidirectional effect was assessed by running a paired 
t-test for each cell line separately, and then pooling results by Fisher's 
method. Overall media effect was significant for all drugs but lapatinib 
(for erlotinib, P= 0.037; lapatinib, P = 0.288; paclitaxel, P=0.014; and 
PD0325901, P=2.8 x 10~*). b, Only PD09325901 achieved an average 
media effect across cell lines that was statistically significant and of 
practically important size (4 percentage points). Data for fixed and 
variable seeding are plotted together because no significant difference 


in mean viability shift was observed between them for any of the four 
drugs considered (two-sided, unpaired t-tests: erlotinib, n = 47, P= 0.86; 
lapatinib, n= 45, P=0.29; paclitaxel, n = 48, P=0.80; and PD0325901, 
n= 46, P=0.44). Mean viability values were derived from biological 
triplicate experiments (Methods). c, SYTO 60 produces more variable 
viability measurements than CTG. The width of the 95% confidence 
intervals for mean viability the 24 cell lines in the follow-on screen 

(on viability scale; see Methods for confidence interval estimation). 
Symbol colour indicates seeding strategy; symbol shape indicates media 
condition. For all drugs and screening conditions, the vast majority of 

cell lines showed wider confidence intervals for SYTO 60 than for CTG, 
indicating increased noise in the SYTO 60 data and reduced precision in 
estimation of mean viability (mean increase for erlotinib was 7 percentage 
points, P=1.2 x 107"; for lapatinib, 8 points, P=1.6 x 10~"*; for paclitaxel, 
5 points, P=2.7 x 10~!°; and for PD0325901, 5 points, P=2.5 x 107’). This 
effect is not related to seeding strategy or media serum concentration. 
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Extended Data Figure 9 | Seeding density affects drug response for 
multiple drugs and cell lines. a, In the follow-up screen, the difference in 


estimated CTG mean viability between variable and fixed seeding is shown 
(with the two media conditions averaged). For broadly active compounds, 
slow growing lines typically yielded higher mean viability estimates under 
variable seeding and therefore produced a positive difference. Fast growing 


lines, on the other hand, tended to yield lower mean viability estimates 
under variable seeding (Spearman's p test, P= 1.71 x 107°). This trend 
was not significant for specifically active drugs (P= 8.77 x 10-7). Lines of 
average growth rate, for which fixed and variable seeding were essentially 
the same and thus no difference was expected, are indicated in the dashed 
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rectangle. b, Similar to panel a, but showing the difference between gCSI 
(variable seeding) and CCLE (fixed seeding). Because mean viability 
statistics are not directly comparable when different dosing schedules are 
used, residuals from a regression of gCSI mean viability on CCLE mean 
viability are plotted in the heat maps rather than a simple difference. The 
line plot shows a moving average (75 cell line window) of the row averages 
of each heat map. As for the follow-up screen, a significant association 
was seen between cell line growth rate and impact of seeding density 

for broadly active drugs (Spearman’s p test, P= 7.76 x 10~°), but not for 
specifically active drugs (P= 1.12 x 107'). Mean viability values were 
derived from biological triplicate experiments (Methods). 
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Extended Data Table 1 | Key differences in assay conditions between projects 


@ Differences in assay conditions 


Condition gCSl CCLE GDSC 
Viability readout CellTiter-Glo CellTiter-Glo SYTO 60 
Cell seeding Variable Fixed Variable 
Media 5% FBS 10% FBS 5% FBS 
Plate format 384 1,536 96 or 384 


b Cell lines available for mean viability comparisons 


gCSI vs. CCLE gCSI vs. GDSC 

specific bid1870 0) 252 
broad bortezomib 0 92 

specific crizotinib 285 102 
broad docetaxel 0 248 
broad doxorubicin 0 288 
specific erlotinib 281 83 

specific gdc0941 0 238 
broad gemcitabine 0 286 
broad irinotecan 166 0 

specific lapatinib 280 96 
specific ms275 0 91 

broad paclitaxel 271 100 
broad pd0325901 281 250 
specific rapamycin 0 96 
broad thapsigargin 0 289 
broad vorinostat 0 242 


a, For viability, g¢SI and CCLE used CellTiter-Glo, which measures metabolism by quantifying ATP levels; GDSC used the SYTO 60 nucleic acid stain, producing a signal roughly proportional to cell 
count. GDSC and gCSI varied the initial number of cells plated for each line in order to achieve similar confluency in control wells at the end of the assay; CCLE used a fixed number of cells for every 
line. GDSC and gCSI supplemented cell culture media with 5% FBS, while CCLE used 10% FBS. All factors except plate format were evaluated in our follow-up screen. Values in red distinguish one data 
set from the other two. b, Cell lines available for mean viability comparisons. The number of available GDSC mean viability or re-calculated CCLE mean viability values were tabulated for all drugs and 
cell lines that had matching gCSI mean viability data (see Extended Data Fig. 1). 
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The chemical modification of structurally complex fermentation products, a process known as semisynthesis, has been an 
important tool in the discovery and manufacture of antibiotics for the treatment of various infectious diseases. However, 
many of the therapeutics obtained in this way are no longer effective, because bacterial resistance to these compounds 
has developed. Here we present a practical, fully synthetic route to macrolide antibiotics by the convergent assembly of 
simple chemical building blocks, enabling the synthesis of diverse structures not accessible by traditional semisynthetic 
approaches. More than 300 new macrolide antibiotic candidates, as well as the clinical candidate solithromycin, have 
been synthesized using our convergent approach. Evaluation of these compounds against a panel of pathogenic bacteria 
revealed that the majority of these structures had antibiotic activity, some efficacious against strains resistant to 
macrolides in current use. The chemistry we describe here provides a platform for the discovery of new macrolide 
antibiotics and may also serve as the basis for their manufacture. 


Natural products have provided critical starting points for the develop- 
ment of a majority of the antibacterial drugs listed as essential medicines 
by the World Health Organization. During the period from about 1940 
to 1960, sometimes described as the golden age of antibiotics research, 
academic and industrial laboratories identified the natural products 
that went on to define many of the major classes of modern antibiotics!. 
Few if any natural products prove to be optimal for safety, efficacy or 
oral use in humans, however, for these were probably not evolutionary 
pressures for the microbes in which they developed. For six decades a 
primary tool by which new antibiotics have been discovered and manu- 
factured is semisynthesis, or the chemical modification of natural prod- 
ucts derived from fermentation. Semisynthesis is inherently limited 
because it is challenging to modify structurally complex natural prod- 
ucts selectively, and typically few positions of any given scaffold can 
be modified effectively”*. Macrolide antibiotics (macrocyclic lactones 
with one or more pendant glycosidic residues), which have proven 
to be safe and effective for use in treating human infectious diseases 
such as community-acquired bacterial pneumonia, gonorrhoea and 
others, provide a compelling case in point. Since the discovery of eryth- 
romycin from a Philippine soil sample in 1949*, in spite of extraor- 
dinary efforts leading to fully synthetic’-* and modified biosynthetic 
routes” | to macrolide antibiotics, all members of this class approved 
or in clinical development for use in humans have been manufactured 
by chemically modifying erythromycin (erythromycin itself is unstable 
in the stomach, and rearranges to form a product with gastrointestinal 
side-effects)'’. Thus, azithromycin is prepared from erythromycin 
in four steps!*, clarithromycin requires six steps'’, and the advanced 
clinical candidate solithromycin is currently produced from eryth- 
romycin by a 16-step linear sequence (Fig. 1)'®!”. New macrolides 
are badly needed, for resistance to approved macrolide antibiotics 
such as clarithromycin and azithromycin is now widespread in both 


hospitals and the community'* “°. Here we present a practical, fully 
synthetic platform for the preparation of macrolide antibiotics, provid- 
ing both a discovery engine for structurally novel antibiotic candidates 
that would be difficult or impossible to obtain from erythromycin, as 
well as a basis for their eventual manufacture. Using simple building 
blocks and a highly convergent assembly process, we have prepared 
more than 300 structurally diverse macrolide antibiotic candidates, 
as well as the approved drug telithromycin and the clinical candidate 
solithromycin. We have identified molecules with diverse macrocyclic 
scaffolds that exhibit potent activities against bacterial strains resistant 
to erythromycin, azithromycin and other current antibiotics of different 
classes. 


Synthesis of 14-membered azaketolides 

To illustrate our approach to the synthesis of macrolide antibiotics, we 
begin by detailing a route to new and highly active hybrid antibiotics, 
14-membered azaketolides (Fig. 2), and then show how the approach 
can be easily expanded to subsume many other active macrolide scaf- 
folds, with broad latitude for substitutional variation. Ketolides replace 
the C3-cladinose carbohydrate residue of erythromycin with a carbonyl 
group, a chemical modification that allows them to evade certain inducible 
forms of macrolide resistance*’*’, whereas the azalide azithromycin 
incorporates a nitrogen atom within an expanded (15-membered) 
macrolactone ring, features that have been proposed to impart favour- 
able pharmacological properties, including increased efficacy against 
certain Gram-negative pathogens***. The hybrid ‘azaketolides’, or 
macrolides that contain both a C3 carbonyl group and a nitrogen atom 
within the macrolactone ring, have been little explored; one such com- 
pound has been reported and was found to be essentially inactive”. We 
were particularly interested in 14-membered azaketolides that would 
arise from formal replacement of the carbonyl group at position C9 
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US FDA approval: 


Figure 1 | Summary of macrolide antibiotic development by 
semisynthesis. To date, all macrolide antibiotics are produced by 
chemical modification (semisynthesis) of erythromycin, a natural product 
produced on the ton scale by fermentation. Depicted are erythromycin 
and the approved semisynthetic macrolide antibiotics clarithromycin, 


of ketolides with an amino group, a transformation that would not be 
feasible by semisynthesis. 

An overview is helpful, as it emphasizes our synthetic strategy. Eight 
simple building blocks (1-8), two of which are commercially available 
and six of which are synthesized in sequences of 3-6 steps in 30-150-g 
amounts, are assembled in a highly convergent manner (Fig. 2). 
As depicted, building blocks 1-3 are transformed by a seven-step sequence 
featuring two convergent coupling reactions to provide a key left-hand 
intermediate (15, an amine) in 57% yield, while building blocks 4-7 
are transformed in a seven-step sequence featuring three convergent 
coupling reactions to form a key right-hand intermediate (22, a ketone) 
in 58% yield. (Here left and right refer to the position in the graphi- 
cal representation of the target molecule, Fig. 2a.) The left- and right- 
hand intermediates are then joined in a highly stereoselective and 
efficient reductive coupling reaction (82% yield, diastereomeric ratio (d.r.) 
>20:1) that also serves to remove the benzoate protective group within 
the desosamine residue. The next step is the key macrocyclization reac- 
tion, for which we employ a method developed by Boeckman et al. in 
the context of their approaches towards the natural products diplodi- 
alide A (a 10-membered macrolactone), kromycin (a 14-membered 
macrolactone) and ikarugamycin (a 17-membered macrolactam)”°”. 
We found that thermolysis of intermediate 23 (1 mM in chlorobenzene) 
at 132°C afforded the macrolactone 24 in 78% yield (1-g scale). There 
are a number of notable features of this transformation, not least its 
efficacy and, as we will show, generality. Many pioneering efforts to 
synthesize macrolide antibiotics failed at the stage of macrocyclization; 
successful macrocyclizations were shown to require carefully designed, 
rigidifying protective group strategies”®. We believe that the success of 
the present cyclization (which we note occurs in the presence of a free 
secondary amine as well as another secondary alcohol) is attributable 
in part to the rigid structuring induced by the cyclic carbamate func- 
tion on the left-hand side of the molecule, which we suspect brings the 
putative acyl ketene function (produced from the dioxolenone function 
upon thermolysis) and the reactive secondary alcohol into favourable 
proximity. A single operation then transforms the product of macro- 
cyclization into a fully synthetic antibiotic candidate (25) while incor- 
porating the final building block using Sharpless’ optimized conditions 
for azide-alkyne dipolar cycloaddition”? (8, 86% yield). This final step 


Clarithromycin 
6 steps from erythromycin 


Cethromycin 
9 steps from erythromycin 
Clinical candidate 


Azithromycin 
4 steps from erythromycin 


HsC F 


Solithromycin 
16 steps from erythromycin 
Clinical candidate 


azithromycin and telithromycin along with the dates of their FDA approval 
and the number of steps for their synthesis from erythromycin. The 
previous ketolide clinical candidate cethromycin and the current clinical 
candidate solithromycin are also depicted. It is evident that increasingly 
lengthy sequences are being employed in macrolide discovery efforts. 


is by design a convergent coupling reaction, and permits optimal late- 
stage diversification of a key side-chain residue that is known to occupy 
a novel pocket in the bacterial ribosome*?™*. 

Aspects of certain steps before the macrocyclization reaction merit brief 
discussion (see Fig. 2). The key aldol coupling reaction that unites building 
blocks 1°? and 2°°, initiating the synthesis of the left-half intermediate, 
involves new methodology that was developed specifically for the present 
purpose (though it proves to be a general transformation) and proceeds 
in 98% yield on a 33-g scale to form a single, crystalline diastereomer”®. 
In a subsequent step, which transforms the pseudoephenamine amide 
function into a methyl ketone in one operation, the pseudoephenamine 
auxiliary is recovered by a simple aqueous extraction procedure in 92% 
yield. The convergent coupling reaction that unites building blocks 
4 and 5 to begin the construction of the right-half intermediate is 
based on a palladium-catalysed ene reaction developed by Mikami and 
co-workers, and proceeds in 93% yield and 92% enantiomeric excess 
(e.e.)°”. Two convergent coupling reactions follow three linear functional 
group interconversions, the first coupling being a Mukaiyama’® aldol 
reaction (to incorporate building block 6°”) whose diastereoselectivity 
was observed to be dependent upon the reaction conditions. The condi- 
tions employed here (magnesium bromide etherate as Lewis acid, trieth- 
ylamine as base, dichloromethane as solvent, and a reaction temperature 
of —78°C) provided maximum selectivity (d.r. >20:1, 91% yield) for 
the desired syn-aldol product. The next convergent coupling reaction 
introduced the protected desosamine carbohydrate residue and for this 
purpose we employed the thioglycoside 7*!, a donor very similar to 
that employed by Woodward et al. in their synthesis of erythromycin’. 
In the present case, we found that the use of a benzoyl group to shield the 
secondary alcohol on the glycosidic donor in lieu of methoxycarbonyl 
improved the yield and diastereoselectivity of the glycosidation reaction. 
The overall yield of the fully synthetic azaketolide antibiotic candidate 25 
was 32% from initial building blocks 1 and 2 (a 10-step sequence) or 32% 
from initial building blocks 4 and 5 (also a 10-step sequence). 


Synthesis of 15-membered azaketolides 

By design, the convergent building block strategy we employ enables 
rapid access to structurally related scaffolds; for example, synthesis 
of a 15-membered hybrid azaketolide scaffold comprising structural 
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Figure 2 | A convergent, fully synthetic route to the 14-membered 
azaketolide 25. a, Graphical representation of the convergent synthesis 

of azaketolide 25 from eight variable building blocks (represented by 
coloured spheres). Downward, “Y’-shaped arrows signify convergent 
coupling reactions. b, Synthesis of azaketolide 25, reagents and conditions 
(subscripts L and R indicate left and right halves, respectively) as follows: 
(ay) LiHMDS, LiCl, 98%; (by) Et'Pr2N, COCL,; (cz) ‘PrMgCl, CH3Li, 


features of both the azalide antibiotic azithromycin and the ketolide clin- 
ical candidate solithromycin (Fig. 3) required modification of just three 
of the eight building blocks that were used to assemble 14-membered 
azaketolides (Fig. 2). The left-hand amine intermediate (15) was the 
same in both 14- and 15-membered azaketolide scaffold syntheses and 
thus construction of the 15-membered azaketolides simply required 
access to the homologated right-hand intermediate, aldehyde 35. 
Here, too, we employed a seven-step sequence (38% yield) featuring 
three convergent coupling reactions to assemble four simple building 
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76% over 2 steps, 92% of recovered (R,R)-pseudoephenamine; 

(d,) NaHMDS; (ey) NaN3, 88% over 2 steps; (f,.) NH3, Ti(O’Pr)4, NaBH, 
95%; (g_) BusNE, 92%; (apg) Pd[(S)-SegPhos]Cl, AgSbF¢, 93%, 92% e.e.; 
(bp) ethylene glycol, PPTS, 92%; (cr) KH, Mel, 97%; (dp) (‘Bu)2AlH, 96%; 
(ep) Et3N, MgBr2eOEt, 91%; (fp) AgOTf, 81%, 16:1 8:05 (gr) HCl, 100%; 
(h) NaCNBHs3, 82%; (i) 132°C, 1mM, PhCl, 78%; (j) CuSO4, sodium 
L-ascorbate, 86%. 


blocks. The first coupling, which unites building blocks 26** and 
27**-** to form the tertiary alcohol 29, emerged only after extensive 
experimentation, in which we discovered that incorporating the large 
tert-butyldiphenylsilyl protective group within component 27 was key 
to achieve high diastereoselectivity. After two subsequent functional 
group interconversions, building blocks 6 (as before) and 28 (the orig- 
inal glycosyl donor of Woodward et al.°, which proved to be effective 
in this scaffold synthesis) were incorporated by successive convergent 
coupling reactions. Two linear steps then provided the right-hand 
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Figure 3 | A convergent, fully synthetic route to the 15-membered 
azaketolide 38. a, Graphical representation of the convergent synthesis 

of azaketolide 38 from eight variable building blocks (represented by 
coloured spheres). Downward, “‘Y’-shaped arrows signify convergent 
coupling reactions. b, Synthesis of azaketolide 38, reagents and conditions 
(subscripts L and R indicate left and right halves, respectively): 

(a,) LiHMDS, LiCl, 98%; (by) Et'Pr2N, COC; (cz) ‘PrMgCl, CH3Li, 


intermediate, aldehyde 35. Reductive coupling of the left-hand amine 
(15) and the right-hand aldehyde (35) in the presence of sodium 
cyanoborohydride proceeded without detectable epimerization to 
afford the macrocyclization precursor 36 in 86% yield. Thermolysis 
of 36 at 132°C in chlorobenzene then afforded the 15-membered 
macrolide 37 in 94% yield on a 1.9-g scale. Incorporation of the final 
building block (8, as before) and cleavage of the methoxycarbonyl 
protective group were achieved in a single, final step to provide the 
azaketolide antibiotic candidate 38. The overall yield of the fully syn- 
thetic macrolide 38 (FSM-20707, see below) was 43% from initial 
building blocks 1 and 2 (a 10-step linear sequence) or 33% from initial 
building blocks 26 and 27 (also a 10-step linear sequence). 


Synthesis of 14-membered ketolides 
We next adapted our convergent assembly strategy to synthesize 
14-membered ketolides, a class that includes the clinical candidate 
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(gr) BugNE, 92%; (ar) ‘Bila, MgBro, 81%; (br) KH, Mel, 99%; (cr) HsIO¢, 
99%; (dr) ZnCly, 91%; (er) AZOTE, 70%; (fg) HF (aq.); (gx) Dess—Martin 
periodinane, 87% over 2 steps; (h) NaCNBH3, 86%; (i) 132°C, 1mM, PhCl, 
94%; (j) CuSO., sodium L-ascorbate, 93%. 


solithromycin as well as the approved drug telithromycin, with the goal 
of preparing diverse structural analogues. Towards this end we envi- 
sioned coupling of the same right-hand intermediate employed in the 
synthesis of 15-membered azaketolides (aldehyde 35) with a Grignard 
reagent derived from hydromagnesiation of the acetylenic alcohol 42 
(Fig. 4). Intermediate 42 was obtained in one step (14:1 d.r., 85% yield) 
by the addition of 1-lithiopropyne (39) to ketone 40*° in the presence 
of lithium (1S,2R)-1-phenyl-2-(pyrrolidin-1-yl)-1-propanolate*’. 
Hydromagnesiation of 42 with cyclopentylmagnesium bromide and 
bis-cyclopentadienyltitanium dichloride*® proceeded regioselectively to 
furnish a vinyl Grignard reagent that was captured in situ with aldehyde 35. 
The resulting diastereomeric mixture of allylic alcohols (obtained in 
80% yield on a 3-g scale) was oxidized directly (Dess—Martin periodi- 
nane, 97%) and the a,8-unsaturated ketone produced was desilylated 
(tetrabutylammonium fluoride (Bu,NF), 95%) to afford the keto alcohol 
45. Macrocyclization, as before, then provided the 14-membered 
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Figure 4 | A convergent, fully synthetic route to solithromycin. 

a, Graphical representation of the convergent synthesis of solithromycin, 
which was previously only accessible by semisynthesis. This route has been 
adapted for the synthesis of >30 novel ketolide antibiotic candidates, as 
well as the FDA-approved ketolide telithromycin. Downward, “Y’-shaped 
arrows signify convergent coupling reactions. b, Synthesis of solithromycin, 
reagents and conditions (subscripts L and R indicate left and right portions, 


macrolactone 46 in 66% yield (1.7-g scale). Fluorination at position C2 
proceeded in 85% yield*”. Transformation of the C12 tertiary alcohol 
within the latter intermediate (47) to the corresponding acyl imida- 
zolide (carbonyl diimidazole (Im2CO), 1,8-diazabicyclo[5.4.0]undec- 
7-ene (DBU)) and trapping with amine 49 then afforded fully synthetic 
solithromycin in one operation (87% yield, the methoxycarbonyl 
protective group was cleaved concomitantly under these conditions). 


342 | NATURE | VOL 533 | 19 MAY 2016 


h 
80% 


Modular building blocks: 8 
Convergent coupling reactions: 7 
From 26 and 27: 14 steps, 16% yield 


OH 


(CH3) 
oO fe) CHs 
45 
66% 
(CH3) | (CH3) 
Oo CHs : (e) CHs 
85% 
47 46 


respectively): (az) lithium (1S,2R)-1-phenyl-2-(pyrrolidin-1-yl)-1-propanolate, 
85%; (br) CuSO, sodium L-ascorbate, 95%; (ar) ‘BuLi, MgBr2, 81%; 

(ba) KH, Mel, 99%; (cr) Hs1O¢, 99%; (dr) ZnClo, 91%; (er) AZOTE, 70%; 
(fg) HF (aq.), 95%; (gx) Dess—Martin periodinane, 92%; (h) Cp2TiCh, 
cyclopentylmagnesium bromide, 80%; (i) Dess-Martin periodinane, 97%; 
(j) BusNE, 95%; (k) 132°C, 0.5 mM, PhCl, 66%; (1) KO’Bu, FN(SO2Ph)», 
85%; (m) Im2,CO, DBU; (n) imidazole hydrochloride, 60°C, 87% over 2 steps. 


The present route to solithromycin proceeds in 14 steps and 16% yield 
from building blocks 26 and 27, and differs from the semisynthetic 
route (16 steps from erythromycin, yields not reported’®) in that the 
linker and its attached side-chain heterocycle are introduced in the final 
step of our synthesis and thus modification of these residues is quite 
facile, whereas the linker residue is introduced 6 steps before the penul- 
timate step in the published semisynthetic route to solithromycin!® 
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Figure 5 | Structures of selected fully synthetic macrolide antibiotics, 
and their MICs compared to those of natural and semisynthetic 
macrolide antibiotics. Top three rows, structural formulae. Bottom row, 
table showing MICs (values in xg m1“! colour coded, key at bottom) 

of current macrolide antibiotics and of selected analogues against 

9 Gram-positive and 5 Gram-negative microorganisms. iErmA, inducible 


We prepared a number of fully synthetic 14-membered ketolides by 
taking advantage of this feature of our synthesis to simultaneously vary 
the linker and side-chain heterocycle, and many of these products show 
potencies that are superior to solithromycin in microbiological assays 
(see below). 


Construction of a library of macrolide antibiotics 

We prepared an initial library of >300 fully synthetic macrolide (FSM) 
antibiotic candidates by varying the building blocks in concert with 
modifying readily diversifiable elements (for example, an azido group, 
an amino group, a B-keto lactone function, an allyl group) that we 
introduced into and around the macrolide ring, a powerful tactical 
combination. In addition to members of the three primary macrocy- 
clic scaffolds discussed in detail above, we prepared a number of other 
unique scaffolds by modifying the principal coupling components 
(left- and right-halves) and their modes of coupling using straight- 
forward alternative chemical transformations (see Supplementary 
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erythromycin ribosome methyltransferase A; cErmA, constitutive 
erythromycin ribosome methyltransferase A; MsrA, macrolide 
streptogramin resistance efflux pump A; MefA, macrolide efflux 
protein A; ErmB, erythromycin ribosome methyltransferase B; Erythro, 
erythromycin; Azithro, azithromycin; Telithro, telithromycin; Solithro, 
solithromycin. 


Information for a complete list of structures synthesized and Extended 
Data Figs 1-10 for exemplary schemes for their preparation). This strat- 
egy not only provided novel scaffolds to explore, but also permitted 
deep-seated variations of positions within these scaffolds, thus enabling 
access to molecules that could not be prepared using semisynthetic 
methods. 


Microbiological testing 

To evaluate our FSMs for antibiotic activity, we screened 305 of them 
against a panel of pathogens comprising standardized Gram-positive 
and Gram-negative strains (see Supplementary Information). For 
the most promising antibiotic candidates, the panel was expanded 
to include bacteria with genetically characterized resistance mech- 
anisms to erythromycin as well as other antibiotics. These include 
three different strains of Staphylococcus aureus, two of them clinically 
isolated methicillin-resistant (methicillin-resistant Staphylococcus 
aureus, MRSA) pathogens, a clinically isolated strain of Streptococcus 
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pneumoniae with both ribosome-modifying methyltransferase (ermB) 
and efflux (mefA) genes, and finally a clinically isolated strain of 
Enterococcus faecalis with an ermB gene, a strain resistant to all approved 
macrolides as well as vancomycin (vancomycin-resistant Enterococcus, 
VRE) (Fig. 5). The data show that a majority of compounds in the 
library exhibit demonstrable antibiotic activity. For example, 83% 
of compounds in the collection displayed a minimum inhibitory 
concentration (MIC) <4y.g ml against wild-type S. pneumoniae, 
which is known to be highly susceptible to inhibition by macrolides 
(see Supplementary Information). Among the most promising com- 
pounds (Fig. 5) are represented variously 14-, 15- and 16-membered 
azaketolide scaffolds (for example, FSM-22391, FSM-20707 and FSM- 
21397, respectively), a 15-membered azacethromycin hybrid anti- 
biotic (FSM-20919), and a number of fully synthetic 14-membered 
ketolides (most notably, FSM-100573 and FSM-100563). The last 
two substances display superior potencies relative to any macrolide 
in current clinical use in the following extremely challenging strains: 
an S. pneumoniae with both ermB and mefA genes (MIC for both FSM- 
100573 and FSM-100563 <0.03 1g ml —'), a VRE with an ermB gene 
(MICs 1 and 2\:g ml‘, respectively), an MRSA with a constitutively- 
expressed erythromycin ribosome methyltransferase (c-erm) gene 
(MICs 161g ml~'), and Pseudomonas aeruginosa (MICs 16 and 
32\1g ml’, respectively). Clearly, further work will be required to drive 
potencies in the latter two strains to clinically efficacious levels, but we 
believe that demonstration of even modest activity in these challenging 
strains by a macrolide is significant. It is interesting and encouraging 
to note that FSM-100573 also displays improved Gram-negative activ- 
ity relative to many other macrolides; further advances in this regard 
would address an important area of unmet medical need®. 


Conclusion 

Employing as a design strategy the multiply convergent assembly 
of simple chemical building blocks, we have developed a platform 
of unprecedented versatility for the discovery and practical synthesis of 
novel macrolide antibiotics. Varying the building blocks as well as (at a 
later stage) diversifiable elements incorporated within them permits an 
almost exponential expansion of variability within any given scaffold. 
In addition, our work shows that variant scaffolds can be obtained by 
straightforward perturbations of our generalized assembly process. 
As with our earlier convergent synthesis of tetracycline antibiotics”, 
we anticipate that many thousands of novel macrolide structures can 
be prepared for evaluation as potential antibiotics using the present 
synthetic platform. In light of this, it seems logical to conclude that 
developing similar convergent routes to other naturally occurring anti- 
biotic families may accelerate the discovery of new therapeutic agents 
for human infectious diseases. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Extended Data Figure 1 | Synthesis of a C2-fluoro 14-membered azaketolide by a late-stage fluorination reaction. Subjection of 3-keto lactone 25 
(FSM-22367) to potassium tert-butoxide (1.0 equiv.) at —78 °C followed by N-fluorobenzenesulfonimide (1.0 equiv.) afforded 50 (FSM-22391) in 43% yield. 
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Extended Data Figure 2 | Synthesis of a 15-membered azaketolide a nearly ideal intermediate for preparation of macrolides with diverse 
with a modified C2-substituent. Thermolysis of a 3-keto tert-butyl ester C2-substitutions. For example, an allyl group was introduced at C2 by 
substrate (55) proceeded at a lower temperature (80 °C) than that for treatment of 56 with sodium tert-butoxide (1.1 equiv.) and allyl iodide 
dioxolenone substrates (132 °C) and afforded a 15-membered macrocycle (1.1 equiv.) at —40°C followed by warming the reaction solution to 23°C. 
without substitution at C2 (56). (Here and elsewhere we omit mention The product 57 (obtained in 62% yield) was then transformed to 59 
of the intermediate compounds for clarity.) This macrocycle served as (FSM-56156) in two steps (via 58; 72% yield). 
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65 (FSM-20919) 

2:1 mixture of epimers at C2 
Extended Data Figure 3 | Synthesis of a 15-membered azacethromycin 
hybrid. Macrolactone 63 was prepared from amine 60 and aldehyde 
61 in two steps (via 62) by a reductive amination-macrocyclization 
sequence. Treatment of 63 with paraformaldehyde (6.0 equiv.) and acetic 
acid (10.0 equiv.) furnished adduct 64 as a crystalline solid (84% yield; 


130 °C 
— 
68% 
63 
3:1 mixture of epimers at C2 
(CH20),, 5 
AcOH 84% 
64 
X-ray structure single epimer at C2 


X-ray structure shown). The imidazolidine group within 64 served to 
protect both the secondary amine and the cyclic carbamate functions, and 
permitted the introduction of a quinoline heterocycle via a Heck reaction. 
Methanolysis (TFA, CH30H) cleaved the imidazolidine group, affording 
65 (FSM-20919; 29%, two-step yield). 
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Extended Data Figure 4 | Synthesis of a 15-membered azaketolide with 
a modified C10-substituent. N-tert-butylsulfinyl imine 68 (prepared in 
five steps via 66 and 67 from amide 10) allowed for the stereocontrolled 
introduction of various C10-substituents. For example, addition of 
allylmagnesium bromide proceeded with >20:1 stereoselectively to 
establish the stereocentre at C10; subsequent cleavage of the sulfinyl 
(HCl, CH3;0H) and tert-butyldiphenylsilyl (Bu,NF) groups within the 
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adduct then furnished left-hand intermediate 69 (81% yield). Amine 

69 and aldehyde 35 were coupled by a reductive amination reaction 
(NaBH3CN, 60%-75% yield). The product (70) was then transformed to 73 
(FSM-11044) in a three-step sequence that consisted of a macrocyclization 
reaction (giving 71; 72% yield), a methanolysis reaction (giving 72 in 
quantitative yield) and lastly a [3 + 2] dipolar cycloaddition reaction 
(giving 73 in 92% yield). 
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Extended Data Figure 5 | Synthesis of a 15-membered azaketolide with 
a modified C13-substituent. Modification of position C13 was achieved 
by modification of a single component, in this case the ketone building 
block 74 depicted above. Reductive coupling of 78 and 35 united the 

left- and right-halves to give 79; subsequent thermal macrocyclization 


NH 
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provided macrolactone 80. The allyl group within intermediate 80 was 
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cleaved upon ozonolysis (O3, trifluoroacetic acid); reductive workup 
with sodium cyanoborohydride afforded alcohol 81. Subjection of 81 to 
bis(2-methoxyethyl)aminosulfur trifluoride afforded the fluoroethyl- 
substituted macrocycle 82 (30%, two-step yield), which was transformed 
to 83 (FSM-11453) by a [3 + 2] dipolar cycloaddition reaction. 
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Extended Data Figure 6 | Synthesis of a 15-membered azaketolide 
with a modified desosamine sugar residue. The 15-membered 
macrolactone 90 was synthesized using thioglycoside 84 and alkyne 
89 as building blocks (in lieu of building blocks 28 and 8 used for the 
synthesis of 15-membered azaketolide FSM-20707). Treatment of 

90 with tributyltin hydride (2.0 equiv.), acetic acid (5.0 equiv.), and 
tetrakis(triphenylphosphine)palladium (2 mol%) led to cleavage of the 


CH, 


91 
4:1 mixture of epimers at C2 


5:1 mixture of epimers at C2 


allyloxycarbonyl protective group, giving rise to amine 91 (92% yield). 
The latter intermediate has been transformed into a number of fully 
synthetic macrolides with modified desosamine sugar residues. For 
example, acylation of the primary amino group of intermediate 

91 with pyridine-2-carbonyl chloride (2.0 equiv.) in the presence 

of trimethylamine (3.0 equiv.) followed by removal of the 
tert-butoxycarbonyl group afforded 92 (FSM-21887, 86%, two-step yield). 
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Extended Data Figure 7 | Synthesis of a 16-membered azaketolide. of amine 15 and aldehyde 93 furnished macrocyclization precursor 94 
Homologation of aldehyde 35 was achieved by a Wittig olefination (73% yield). The 16-membered macrolactone 95 was obtained in 78% yield 


reaction (CH;0CH3PPh3* Cl-, NaHMDS) followed by hydrolysis of the upon thermolysis of 94 (1 mM, 132°C). Two additional steps transformed 
resulting enol ether to afford aldehyde 93 in 65% yield. Reductive coupling 95 via 96 to the 16-membered azaketolide 97 (FSM-21397). 
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Extended Data Figure 8 | Synthesis of a 14-membered macrolide with 

a trans-olefin linkage. Mesylate 98 was prepared in quantitative yield by 
treatment of alcohol 34 with methanesulfonyl chloride (1.50 equiv.) and 
triethylamine (2.0 equiv.). Displacement of the mesylate group in 98 with 
sodium 1-phenyl-1H-tetrazole-5-thiolate (2.0 equiv.) followed by oxidation 
of the resulting thioether with ammonium molybdate (0.20 equiv.)-hydrogen 
peroxide (100 equiv.) afforded sulfone 100 in 70% yield. Aldehyde 101 
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and sulfone 100 were coupled in a Julia—Kocienski olefination reaction 
(NaHMDS, —78 — 23 °C) to provide a 4.8:1 mixture of E- and Z-olefin 
isomers. The E-isomer 102 was isolated and desilylated (Bu,NF, 79%). 
Thermolysis of the product 103 (1 mM, 132°C) furnished the 
14-membered macrocycle 104 in 83% yield. 104 was then transformed 
to 106 (FSM-21079) in two additional steps (methanolysis and [3 + 2] 
dipolar cycloaddition) via 105. 
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Extended Data Figure 9 | Synthesis of a 15-membered macrolide with 
an amide linkage (C9-N9a). Oxidation of aldehyde 35 with sodium 
chlorite (10.0 equiv.) in the presence of sodium dihydrogen phosphate 
(10.0 equiv.) and 2-methyl-2-butene (100 equiv.) afforded carboxylic acid 
107 in 70% yield. Acid 107 and amine 15 were coupled in the presence 
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of 1-ethyl-3-(3-dimethylaminopropyl)carbodiimide (2.00 equiv.) to 
provide amide 108. Macrocyclization of 108 (1 mM, 132°C) proceeded 
in 81% yield to afford macrolactam 109. Methanolysis and [3 + 2] dipolar 
cycloaddition then transformed 109 to 111 (FSM-21344) in two steps 

via 110. 
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Extended Data Figure 10 | Synthesis of a 15-membered macrolide with 
an amide linkage (C10-N9a). Amine 112 was prepared in 70% yield by 
displacement of the mesylate group in 98 with sodium azide followed 

by reduction of the resulting alkyl azide (H2, Pd). The coupling of amine 
112 and acid 113 proceeded in 54% yield. The product, amide 114, (FSM-21473) in 64% 
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was desilylated (Bu,NE, 80%) to afford the macrocyclization precursor 
115. Thermal macrocyclization of 115 followed by cleavage of the 
methoxycarbonyl protective group afforded lactam 116 in 80% yield. 
Copper-catalysed [3 + 2] dipolar cycloaddition then provided 117 
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Structure of the T4 baseplate and its 
function in triggering sheath contraction 


Nicholas M. I. Taylor!, Nikolai S. Prokhorov!, Ricardo C. Guerrero-Ferreira', Mikhail M. Shneider!*, Christopher Browning't, 


Kenneth N. Goldie*, Henning Stahlberg* & Petr G. Leiman! 


Several systems, including contractile tail bacteriophages, the type VI secretion system and R-type pyocins, use a 
multiprotein tubular apparatus to attach to and penetrate host cell membranes. This macromolecular machine resembles 
a stretched, coiled spring (or sheath) wound around a rigid tube with a spike-shaped protein at its tip. A baseplate 
structure, which is arguably the most complex part of this assembly, relays the contraction signal to the sheath. Here we 
present the atomic structure of the approximately 6-megadalton bacteriophage T4 baseplate in its pre- and post-host 
attachment states and explain the events that lead to sheath contraction in atomic detail. We establish the identity and 
function of a minimal set of components that is conserved in all contractile injection systems and show that the triggering 


mechanism is universally conserved. 


Bacteriophages (viruses of bacteria) use a specialized organelle called 
a tail to deliver their genetic material and proteins across the cell enve- 
lope during infection’. In phages with the most complex contractile 
tails, attachment to the host cell is accompanied by a substantial trans- 
formation of the tail structure: the external tail sheath contracts and 
drives a spike-tipped”, rigid tube through the host cell membrane. 
Other macromolecular complexes, such as the type VI secretion sys- 
tem (T6SS)’~!', metamorphosis-associated contractile (MAC) arrays’, 
R-type pyocins!*"*, Serratia antifeeding prophage!, Photorhabdus 
virulence cassette’® and rhapidosomes’”’, use a similar contractile 
sheath-rigid tube mechanism to breach the bacterial or eukaryotic 
cell envelope. However, the most complex part of these ‘contractile 
injection systems’ ’—the baseplate, which is responsible for coordi- 
nating host recognition or other environmental signals with sheath 
contraction—remains poorly characterized, and even its exact protein 
composition remains unknown for otherwise well studied systems such 
as bacteriophage T4 (refs 1, 8). 

The T4 baseplate is currently thought to contain at least 15 different 
proteins with copy numbers ranging from 1 to 18 (ref. 19). Assembly 
of the T4 baseplate involves two large independent intermediates: 
a hub and a wedge”. Several phage and possibly host cell chaperones 
mediate the joining of six wedges to the hub, which is then followed by 
the attachment of receptor-binding fibres to this structure. The nascent 
baseplate initiates tube assembly and subsequent polymerization of the 
sheath in the extended high-energy state*>”*. 

The remarkable structure and transformation of the T4 tail and 
other contractile injection systems have received considerable atten- 
tion? >'°!2.1525-32, However, many questions remain unanswered, such 
as how the baseplates are organized at the molecular level, how they 
circularize, how the tail fibre network communicates with the central 
part of the baseplate, and how the signal of host cell surface binding 
leads to the conformational change that releases the tube and induces 
sheath contraction. To answer these questions, we used cryo-electron 
microscopy (cryo-EM) to determine the structure of the T4 baseplate- 
tail tube complex at a resolution of 3.8 A for the universally conserved 
central part and tail tube and 4.1 A overall, and the structure of the 


post-attachment baseplate at a resolution of 6.8 A. We have built and 
refined the atomic model of the baseplate in its pre- and post-host 
attachment conformations, allowing us to visualize the transformation 
of this 440,000-atom (not counting hydrogens) structure. We have ana- 
lysed the signal transduction network of the baseplate by mutagenesis 
and examined the baseplate assembly pathway in the T6SS. 


Overall structure of the T4 baseplate 

We produced the pre-host attachment (also known as ‘hexagonal’?) 
baseplate-tail tube complex in vivo by stopping phage assembly at the 
stage of sheath polymerization*® (Extended Data Fig. 1). The complex 
switched to a post-attachment (also known as ‘star-shaped’*?) con- 
formation upon interaction with the cell membrane and sometimes 
spontaneously; at this point it lost the tube and central spike complex 
(Extended Data Fig. 1). 

The large size of the T4 baseplate (the diameter is 490 A, about 
twice that of the ribosome) made particle alignment very accurate 
but contributed to a substantial difference in resolution at the centre 
and edges of the reconstructed volume (Extended Data Fig. 2). We 
used local mask-based particle alignment***? to improve the electron 
density around the periphery of the baseplate (Extended Data Fig. 2). 
Furthermore, because this region is made of mostly trimeric pro- 
teins, we were able to improve the maps by additional local three-fold 
averaging. 

The final atomic model is around 96% complete and contains 56,082 
amino acid residues (9,886 unique) (Supplementary Tables 1 and 2). 
These amino acids belong to 145 polypeptide chains of 15 different 
proteins (gene products, gps) that comprise the baseplate (gp5, gp5.4, 
gp6, gp7, gp8, gp9, gp 10, gp11, gp12, gp25, gp27 and gp53) and the 
proximal region of the tail tube (gp19, gp48 and gp54, although gp48 
and gp54 could also be considered to be baseplate components’; Fig. 1). 
The tape measure protein gp29, which is located inside the tube and 
is thought to span its entire length’, is not resolved in our cryo-EM 
map. Gp5, gp5.4 and gp27 form the central spike complex that creates 
an extension of the tube. The rest of the baseplate can be divided into 
inner (gp6, gp25, gp53 and part of gp7), intermediate (gp8 and the 
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Figure 1 | Maps and atomic models of T4 baseplate in pre- and post- 
attachment states. a, b, Cryo-EM reconstructions of the pre- and post- 
attachment T4 baseplate, respectively. c-f, Atomic models of the two states 


rest of gp7), and peripheral (gp9, gp10, gp11 and gp12) parts with the 
peripheral part constituting the tail fibre network (Fig. 1). 


The inner baseplate 

Twelve copies of gp6 form a continuous ring that makes up most of 
the inner baseplate (Extended Data Fig. 3). The asymmetric unit of 
the ring contains two gp6 molecules that are chemically identical but 
have markedly different conformations. One molecule (gp6A) lies 
closer to the six-fold axis whereas the other (gp6B) is further away 
(Extended Data Fig. 3a, b). Each gp6 molecule in the ring interacts 
with its two neighbours, forming an N-terminal dimer with one and a 
C-terminal dimer with the other (Extended Data Fig. 3c, d). An earlier 
study proposed a similar organization of the gp6 ring”®, and the crystal 
structure reported in that study matches the gp6 C-terminal dimer in 
the complete baseplate (Extended Data Fig. 3d). However, the inter- 
action of the gp6 N-terminal regions is more complex than presumed 
earlier. 


ARTICLE 


i Inner baseplate 

G1 Intermediate baseplate 
BB Tail fibre network 

Bi Central spike 

Di Tail tube 


p7 [J 9p8 


with component proteins shown as ribbon diagrams. In d and f, the STFs 
(gp12 trimers) are displayed semi-transparently to indicate that they are 
not present in the refined model of the post-attachment baseplate. 


The N-terminal half of gp6 is part of a (gp6).-gp7 heterotrimeric 
module that consists of an a-helical ‘core bundle’ and a flat ‘trifurcation 
unit’ (Fig. 2). Gp25 and gp53 connect the core bundle to the central 
hub and the tube (Figs 2a, 3a), whereas the trifurcation unit allows the 
three polypeptide chains comprising it to reach different destinations 
(Fig. 2d). Two gp6 molecules extend in opposite directions to form 
C-terminal dimers that connect the trifurcation units into an iris-like 
structure (Fig. 2d). Gp7 follows a path towards the periphery of the 
baseplate, where it interacts with the tail fibre network proteins gp9 and 
gp10. The (gp6)2-gp7 module plays a critical role in the assembly and 
function of the baseplate because it mediates both the circularization 
of the baseplate and the communication of the tail fibre network with 
the baseplate hub. 

The (gp6).-gp7 heterotrimeric module contains large insertions that 
do not distort the gp6-gp7 iris (Fig. 2d). A wing-shaped domain (resi- 
dues 101-315) is inserted between the core bundle and the trifurcation 
unit in gp6 (Fig. 2b). The wing domain of gp6A tightly cradles the hub, 
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Figure 2 | The (gp6)2-gp7 heterotrimeric unit. a, Composition of 

the largest assembly intermediate of the baseplate (the wedge) and its 
location in the cryo-EM map. Gp12 and gp9 are not part of the wedge and 
are shown as visual clues to help with interpretation. The arrow points 

to the (gp6)2-gp7 heterotrimeric unit. b, The structure of the (gp6)2- 


whereas that of gp6B forms a segment of the platform for assembly of 
the first layer of the sheath (Extended Data Fig. 3c). In gp7, an inser- 
tion domain (residues 761-862) is part of the trifurcation unit. It is 
positioned close to the three-fold axis of the unit on the side opposite 
to the core bundle (Fig. 2b). 

Gp25 and gp53 form extensive interactions with the core bundle that 
stabilize the entire assembly. Gp25 is firmly attached to the tip of the 
bundle (the buried surface area is 1,525 A”) with its residues 1-28 cap- 
ping the bundle (Fig. 3b). Gp53 consists of two domains, one of which 
(residues 1-103 and 155-193) clamps the core bundle and encircles it 
for about three-quarters of a turn (the buried surface area is 3,179 A”) 
while the other extends the sheath assembly platform formed by the 
gp6B wing domain. 

Both gp25 and gp53 contain conserved motifs that interact with 
each other and with other components of the core bundle. The most 
conserved region of gp53 and its orthologues is a LysM motif?’ (resi- 
dues 46-82; Fig. 3b). The tip of a structurally conserved loop of gp25 
(residues 47-49) mediates the interaction between LysM and the core 
bundle. Orthologues of gp25 across many contractile ejection sys- 
tems contain an EPR motif (Glu-Pro-Arg, residues 85-87 of gp25)°**. 
This motif interacts with the core bundle (Fig. 3b) and points towards 
the region of the gp27-gp48 interface, which is not resolved in our 
cryo-EM map. The remarkable degree of conservation of this motif and 
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gp7 heterotrimer. Regions with lower conservation are shown semi- 
transparently. c, A close-up view of the (gp6)2-gp7 core bundle. 

d, The iris created by the (gp6)2-gp7 trifurcation domains and gp6 
C-terminal dimers. e, A slice of the structure shown in c demonstrates 
three-fold symmetry. 


its location at the baseplate-tail tube interface suggest that it might per- 
form an additional, possibly enzymatic, function, such as maturational 
proteolysis, which is widespread in virus assembly”’. 


The central spike and the tube 

The density of the trimeric part of the (gp5.4)|-(gp5)3-(gp27)3 
central spike complex is a superposition of its two non-equivalent posi- 
tions related by a 60° rotation around the six-fold axis of the baseplate. 
Nevertheless, the superimposed 8-strands of the tandem tube domains 
of gp27 are perfectly resolved, confirming its three-to-six-fold symme- 
try adaptor function, as proposed in an earlier study’. Thanks to the 
prominent lysozyme domain of gp5, placement of the trimeric part of 
the spike in its two possible positions is unambiguous. 

The tail tube is a smooth continuation of the spike complex, and 
gp48 and gp54 form its first two annuli (Fig. 3a). The tube body, as 
expected’, is made of stacked gp19 hexamers. All the three proteins 
have a fold common to tube proteins from other contractile injection 
systems (for example, T6SS Hcp (ref. 11) or PA0623 of R-type pyocin"’). 
The electron density is of sufficient quality to build the atomic model 
of the first two gp19 rings, allowing us to define the symmetry of the 
tube. Its helical rise and twist are 40.2 A and 17.9°, respectively. These 
parameters are very close to those of the extended sheath determined 
in previous, lower-resolution studies’ and probably represent the true 
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Figure 3 | Structure of the conserved inner baseplate. a, The minimal 
composition of a contractile injection system is derived from the T4 tail 
structure: the central spike complex (gp5, gp5.4, gp27), the conserved part 
of the wedge (gp6, gp7, gp25, gp53), the tail tube (gp19, gp48, gp54) and 
the conserved part of the sheath (gp18), some of which is modelled using 


symmetry of the extended T4 tail. Gp54 and gp48 obey the helical sym- 
metry of the tube. The rise and twist of superposition of the gp54 ring 
onto the gp19 and gp48 hexamers are 39.9 A and 17.2°, and —38.8A 
and —22.1°, respectively. 

In the complete tail, gp54 interacts with the first layer of sheath subunits 
(Fig. 3a), suggesting that gp54 is involved in the initiation of sheath 
assembly. However, most contractile injection systems lack gp54 
orthologues and their first sheath layer must therefore bind to the main 
tube protein. Thus, the baseplate and the universally conserved gp25-like 
protein’ in particular play a more important role in the initiation of 
sheath assembly in the extended state, as discussed further below. 


Higp7 DOogpsA Ggp8B gpa 


Hgp10A gp10B Ggp10C Hgpt1 Ogpt2 


gp9 trimer 


gp12 trimer 


Figure 4 | The intermediate baseplate and the tail fibre network. Shown 
are gp7, gp8 (surface representation), gp9, gp10 (surface representation), 
gpll and gp12. A symmetry-related gp12 trimer is displayed semi- 
transparently. Insets: 1) gp10-gp10 and gp10-gp7 disulfide bridges; 2) 
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the pyocin sheath"*. b, A close-up of the structure of the conserved region 
of the wedge consisting of the (gp6).-gp7 heterotrimer, gp25 and gp53. 
The EPR motif of gp25 and the LysM motif of gp53 are highlighted with 
semi-transparent grey. 


Notably, T6SS gene clusters do not appear to contain orthologues of 
either gp48 or gp54 (Supplementary Table 3), presumably leaving both 
functions to Hcp, the main tube protein of T6SS. Hcp could acquire 
new functions by binding an adaptor or effector protein to its ‘dough- 
nut-hole’ surface*’. On the other hand, bacteria often carry several 
copies of hcp genes that can encode products with different functions. 


The tail fibre network 

The T4 baseplate distinguishes itself from its orthologues by being 
equipped with a very complex tail fibre network (Fig. 4). It consists of a 
gp9 trimer ((gp9)3) to which a long tail fibre (LTF) comprising (gp34)s, 


B-mercaptoethanol 


ve 


gp/-gp10 


SDS-PAGE of T4 baseplates in reducing and non-reducing conditions 
(the full gel is shown in Supplementary Fig. 1); 3) close-up of the 
interaction of gp7 with gp8. Residues 890-918 of gp7 form a long 8-strand 
that runs across the surface of the gp8 dimer end-to-end. 
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Pre-attachment 


Figure 5 | Conformational change of the T4 baseplate upon host 
cell attachment. The left and right panels show the pre- and post- 
attachment structures that are derived from cryo-EM data. The middle 
image is a model of an intermediate with assumptions described in the 


(gp35),, (gp36)3 and (gp37)3 is attached, and a (gp10)3-(gp11)3-(gp12)3 
module, in which (gp12)3 is the short tail fibre (STF). The (gp10)3- 
(gp11)3-(gp12)3 network forms a spring-like mechanism that extends 
upon interaction of the phage with a host cell. Our baseplate-tail tube 
complexes represent an intermediate of phage particle assembly and 
thus lack LTFs (except for a few aberrantly bound ones) because LTFs 
attach to otherwise complete phage particles at the very end of the par- 
ticle assembly pathway”. The cryo-EM density of the LTF-attachment 
protein gp9 is weak but could nevertheless be easily interpreted with 
the help of its crystal structure*! (Extended Data Fig. 2i). 

Gp10 is key to understanding the function and evolution of the tail 
fibre network (Extended Data Fig. 4). This trimeric molecule has a 
distorted letter X shape, but each of its four domains (D1-D4) displays 
clear three-fold symmetry. A chain order swapping event takes place 
between D2 and D3 (Extended Data Fig. 4c). D1 is homologous to 
the N-terminal part of gp9 and this relationship could be detected at 
the level of the amino acid sequence. Unexpectedly, however, gp10 D2 
and D3 are structurally similar to each other. They interact with the 
N-terminal motifs of gp12 and gp11, respectively, which themselves are 
also homologous (Extended Data Fig. 5a). Moreover, each of these units 
has a common three-fold axis (for example, gp10 D2-gp12) that orients 
the axes of the gp11 and gp12 trimers to be roughly perpendicular to 
each other. The X-like shape of gp10 allows each gp10-gp11 complex 
to interact with two STFs in the baseplate. One of the STFs extends D2 
of gp10, and the other wraps around gp11 as described below. 

Gp10 domains D1 and D4 attach the (gp10)3-(gp11)3-(gp12)3 module 
to gp7 in the intermediate baseplate. Our structure shows, and mass spec- 
trometry confirms, that apart from non-covalent interactions, the gp10 
D4-gp7 interface involves a disulfide bridge between Cys555 of one gp10 
chain and Cys184 of gp7 (Fig. 4). The Cys555 residues of the two remain- 
ing gp10 molecules form a disulfide bond (Extended Data Fig. 5b). 
This finding explains the gp10-gp10 inter-chain disulfide bridge found 
in the recombinant (gp10)3-(gp11)3 complex*’. The gp7-gp10 and 
gp10-gp10 crosslinks are clearly seen in purified baseplates (Fig. 4). 
Cys555 of Gp10 and Cys184 of gp7 are conserved in all T4-like phages 
(Extended Data Fig. 5c, d), making the covalent attachment of the tail 
fibre network to the intermediate baseplate a hallmark of these viruses. 

Using a multistep modelling procedure (see Methods), we were able 
to build the complete structure of the STF (Fig. 4 and Extended Data 
Fig. 6). Its previously unknown part (residues 2-245)**** contains six 
repeats (Extended Data Fig. 6a—c) that follow the N-terminal gp11-like 
motif (residues 2-47; Extended Data Fig. 5a). Two kinks allow the fibre 
to fold neatly around the periphery of the baseplate. The first bend 
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text. The insets show close-up views of the central part of the baseplate 
demonstrating a release of the tail tube-central spike complex, whose 
position is kept unchanged throughout the transformation. 


(approximately 30°) is between repeats 1 and 2 where the fibre contacts 
the C-terminal domain of a symmetry-related mate. The second, much 
sharper bend (approximately 90°) is between repeats 3 and 4 where the 
fibre wraps around the gp11 trimer. In this region, the gp11-proximal 
chain of gp12 folds into an a-helix whereas the other two chains adopt 
an extended conformation. 

The structural data presented above show that all trimeric proteins 
comprising the baseplate periphery and fibres contain structurally 
related domains (Extended Data Fig. 6d). Thus, despite their different 
functions in the T4 particle, it is possible that all these proteins evolved 
from a common ancestor by multiple gene duplication events. Notably, 
simpler baseplates contain only one trimeric protein (the host receptor 
binding fibre) that is attached to a (gp6).-gp7-like unit!. 


The inner baseplate is universally conserved 

The structure of the T4 baseplate allows us to define the composition 
of baseplates in other contractile injection systems. Using HMM pro- 
files and structures of T4 baseplate proteins in HHsuite***, we found 
orthologues of all inner baseplate components in all systems except for 
a gp53-like component in T6SS (Supplementary Table 3). This com- 
ponent could nevertheless constitute a domain in one of the T6SS core 
bundle proteins (TssF or TssG) as both are larger than some of their 
phage orthologues. Such a rearrangement of functional domains is not 
uncommon and, for example, gp102 of bacteriophage A511 combines 
the functions of gp25 and gp53 into a single protein (Supplementary 
Table 3). 

To test the validity of our bioinformatics-based assignment of func- 
tions, we examined the assembly of the T6SS baseplate, as it appears 
to be more distantly related to T4 than other systems. We found that, 
similar to T4, the assembly pathway of the T6SS baseplate involves a 
wedge-like intermediate that contains a (gp6).-gp7-like heterotrimer 
(Extended Data Fig. 7a—d; for additional details regarding T4 baseplate 
assembly, see Supplementary Information). The T6SS orthologues of T4 
gp25, gp6 and gp7 (TssE, TssF and TssG, respectively) form a complex 
with TssK that has a stoichiometry of (TssE),—(TssF)—(TssG)-(TssK)3 
(Extended Data Fig. 7e, f). No complex was detected in the absence 
of TssK. 

Considering the strength of bioinformatic predictions and the bio- 
chemical data, we conclude that the TssE-(TssF)2-TssG unit forms the 
core bundle, the trifurcation unit, and the sheath polymerization plat- 
form in the T6SS baseplate. Furthermore, similar to the function of the 
gp10 trimer in T4 (ref. 23), trimeric TssK (ref. 47) appears to initiate 
the assembly of the T6SS wedge. 


© 2016 Macmillan Publishers Limited. All rights reserved 


Conformational switch of the baseplate 

Although the structure of a single state of a macromolecular machine 
as complex as the T4 baseplate can provide a plethora of information, 
to understand fully how it works one needs to describe the machine in 
its multiple states. Fortuitously, a small fraction of the baseplates in our 
study switched to the post-host attachment state (Extended Data Fig. 1). 
We selected all such particles and calculated a reconstruction to 6.8A 
resolution. Virtually all domains from the pre-attachment hexagonal 
baseplate fit into this map as rigid bodies (see Methods). One nota- 
ble exception is a loop formed by residues 841-862 of gp7. Cryo-EM 
density convincingly places this loop on two opposite sides of the gp7 
trifurcation unit insertion domain in the two conformations (Extended 
Data Fig. 8a, b), indicating that it performs a ‘jump rope’ type of motion 
around this domain during the conformational switch. The location 
of this loop in our post-attachment structure is similar to that found 
in the 3.8 A resolution reconstruction of the recombinantly assembled 
star-shaped baseplate. 

To better understand and visualize structural changes that relate the 
two conformations, we aligned the two structures using gp53 molecules 
as a reference (see Methods) and performed interpolation with the help 
of UCSF Chimera’? (Fig. 5 and Supplementary Video 1). This proce- 
dure resulted in intermediates with no grave clashes or chain breaks 
although it did not work for the STFs or the jump rope loop of gp7. 
Several intermediates had to be created as described in Methods. 

All baseplate proteins participate in the conformational switch (Fig. 5 
and Supplementary Video 1). The tail fibre network displays the largest 
amount of motion. The STFs unravel and straighten, as can be seen 
in raw images of baseplates attached to membranes (Extended Data 
Fig. 2j). The multivalent structure of the gp10 trimer plays a critical 
role in this transformation. Three-fold symmetrical gp10-gp11 and 
gp10-gp12 interfaces are preserved in the two conformations, sug- 
gesting that the gp10-gp11-gp12 complex rotates as a unit during the 
transformation. At the end of this rotation, the N-terminal domain of 
gp12 points straight towards the host cell surface. On the other hand, 
many interactions that lack three-fold symmetry in the pre-attachment 
state are broken: gp11 dissociates from the gp12 fibre that was origi- 
nally wrapped around it (gp11 then binds the LTF protein gp34 in the 
post-attachment baseplate*); and the C-terminal domain gp12 disso- 
ciates from a symmetry-related gp12 fibre and binds to a receptor on 
the host cell surface. 

The reorganization of the inner baseplate can be best understood by 
examining the motion of (gp6)2.-gp7 heterotrimers (Extended Data 
Fig. 9 and Supplementary Videos 2, 3). Gp7 transfers the rotation of the 
gp10-gp11-gp12 module to the (gp6).-gp7 heterotrimer, causing the 
latter also to rotate as a unit and tilt its three-fold axis by about 30°. As 
a result, the diameter of the (gp6)2-gp7 iris increases and the proteins 
bound to the core bundle (gp25 and gp53) dissociate from the tube 
and the hub (Fig. 5), preparing the latter for subsequent sheath-driven 
translocation. 

To analyse how the relative positions of different baseplate domains 
influence the baseplate conformational change and phage infectivity, we 
subjected gp7 to site-directed mutagenesis by extending or shortening 
some of its domain-linking regions by one amino acid (Extended Data 
Fig. 8 and Supplementary Information). We also attempted to remove 
the gp7-gp10 disulfide bridge. The region preceding the insertion 
domain of the trifurcation unit did not tolerate the two desired muta- 
tions, whereas small changes in the length of the linker upstream of the 
trifurcation unit and the jump rope loop did not affect phage infectivity 
(Extended Data Fig. 8). On the other hand, removal of the gp7-gp10 
covalent linkage had a profound effect on infectivity, confirming that 
this bond is important for maintaining the structure of the baseplate 
during the conformational switch. 

To extrapolate our results to an even larger framework, we used all 
available structural and functional data'”® to construct an atomic 
model of the T4 tail in the extended and contracted states (see Methods, 
Extended Data Fig. 10). Gp25, gp54 and the gp6B-gp53 platform create 
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a complex interface that promotes the assembly of the first layer of 
the sheath in the extended conformation (Extended Data Fig. 10a). 
Gp25 serves as a nucleus for sheath polymerization because the struc- 
ture and symmetry of the gp25-gp48 complex are repeated in every 
sheath-tube subunit interaction. In the assembled structure, gp25 
is a seamless continuation of the lattice formed by gp25-like tube- 
interacting domains of the sheath. To achieve this, gp25 must accept 
the long N- and C-terminal arms of the first sheath subunit'**??, thus 
firmly attaching the sheath to the baseplate (Extended Data Fig. 10; for 
additional details see Supplementary Information). In the contracted 
tail and the post-attachment baseplate, an outward motion and rota- 
tion of the core bundle with the associated proteins (gp25 and gp53) 
causes an even larger outward motion and around 45° rotation of sheath 
subunits’ (Extended Data Fig. 10 and Supplementary Video 2). The 
sheath appears to maintain its original interaction with gp25, but now 
finds a different, presumably more energetically favourable, interface on 
the gp6B-gp53 platform (Extended Data Fig. 10b). Thus, in agreement 
with its high conservation in contractile injection systems’, gp25 plays 
a critical role in sheath assembly and contraction. 

The structural transformation of the baseplate described here 
involves proteins that are universally conserved in all contractile injec- 
tion systems (except for the unknown T6SS gp53 orthologue). A similar 
sequence of events is likely to occur in any such system, regardless of 
the complexity of its peripheral baseplate or tail fibre network. Thus, 
the conformational switch consists of the following steps: an exter- 
nal factor triggers reorganization of the peripheral baseplate; the gp7 
orthologue transfers this signal to a gp25- (gp6).-gp7-like module; and 
this module reorients, releasing the hub complex while simultaneously 
initiating sheath contraction. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Purification of the T4 baseplate-tail tube complex. We purified T4 baseplates by 
improving a protocol established in a previous study”* by implementing new tools, 
resins and media (Extended Data Fig. 1a). This protocol involves a phage mutant 
that carries the amber stop codon in the 5’ region of the sheath and major capsid 
protein genes (T4-18am-23am). T4 particle assembly is ordered and infection of 
a non-amber-suppressing (Sup®) cell strain by such a mutant results in accumu- 
lation of assembly intermediates up to the point of incorporation of the defective 
gene product (that is, baseplate—tail tube complexes and fibres in the case of 
T4-18am-23am; Extended Data Fig. 1b, c). 

Six litres of Sup? Escherichia coli B® cells were grown at 37°C to an attenuance 
of 0.6 at 600 nm (Ago = 0.6) and infected by T4-18am-23am with a multiplicity 
of infection (M.O.I. or phage-to-cell ratio) of 5. Eight minutes later, the same 
amount of phage particles were added, resulting in lysis delay”! or ‘superinfection. 
The infected cells were incubated at 30°C for an additional 150 min, collected by 
centrifugation and flash frozen in liquid nitrogen. The cells were then thawed 
and resuspended in 120 ml of lysis buffer (50 mM Tris-HCl pH 8.0, 1mM MgCh, 
30g ml~* DNase I, 100j1g ml~* lysozyme), and incubated at 30°C for 30 min. 
Then, an equal volume of RNase buffer (50 mM Tris-HCl pH 8.0, 40 mM EDTA) 
containing RNase at 501g ml~' was added, and the cells were incubated at 30°C 
for an additional 30 min. 

This sample was concentrated to ~20 ml with the help of Centricon Plus-70 
(30kDa cut-off) filtration devices (Merck Millipore) and centrifuged at 50,000g 
overnight. The pellet was resuspended in purification buffer (50 mM Tris-HCl 
pH 8.0, 1mM MgCl,), loaded on a sucrose gradient (5-30% (w/v) with a 50% cush- 
ion in the same buffer), and centrifuged at 35,000g for 4h. Fractions containing 
baseplates were pooled, diluted with the same buffer and centrifuged at 50,000g 
overnight. The pellet was resuspended in SM8 buffer (50 mM Tris-HCl pH 8.0, 
100mM NaCl, 8mM MgSO,), loaded on an OptiPrep (PROGEN Biotechnik) gra- 
dient (10-40% (w/v) with a 50% cushion) buffered by SM8 buffer and centrifuged 
at 35,000g¢ for 24h. Fractions containing baseplates were pooled and subjected to 
size-exclusion chromatography to remove the OptiPrep compound. A Superose 6 
Increase 10/300 GL column (GE Healthcare Life Sciences) pre-equilibrated with 
SM8 buffer was used. Peak fractions were pooled and centrifuged at 50,000g for 4h. 
The pellet was resuspended in about 20011 of SM8 buffer, to a final concentration 
of around 1mg ml. 

Cryo-EM sample preparation and data acquisition. T4 baseplates were applied 
to glow-discharged, 300-mesh, copper Quantifoil grids (Quantifoil). The grids 
were blotted for 3s and vitrified by plunging into liquid ethane cooled down to 
liquid nitrogen temperature using a Vitrobot Mark IV (FEI). Cryo-EM samples 
were imaged using a K2 summit direct electron detector (Gatan) on an FEI Titan 
Krios at an acceleration voltage of 300kV and equipped with a Gatan image filter 
(slit width 20 eV) (Supplementary Table 2). Images were collected at a defocus 
of around 1.5\1m and recorded in dose fractionation mode (40 frames) at a dose 
rate between 6 and 8 e” per pixel per s, dose per image sub-frame between 1 and 
1.5e° A~? and a cumulative dose between 40 and 60 e~ A~’. The effective pixel size 
of the images was 1.32 A per pixel at a nominal magnification of 105,000 x. Frames 
3-40 were aligned by motion correction” as implemented in 2dx_automator”’. 
Cryo-EM data processing. For all 1,699 micrographs, we calculated the defo- 
cus parameters using CTFFind4 (ref. 54) and discarded 78 micrographs that 
fell outside a 0.5—-4-\1M defocus window, had a cross-correlation of lower than 
0.006 or did not have Thon rings extending to at least 8 A. 47,516 pre-attach- 
ment and 6,900 post-attachment particles were picked using e2boxer.py in 
EMAN2 (ref. 55). 

For the pre-attachment baseplate, particles were classified into 250 classes using 
reference-free 2D classification with RELION 1.4 (ref. 56). The 47,362 particles 
belonging to the best 41 classes were regrouped into 421 groups, and used to cal- 
culate the initial reconstruction using RELION 3D auto-refine. The previously 
published 12 A resolution baseplate-tail tube complex (EMDB code 1048)*° was 
filtered to 60 A and used as an initial reference. This procedure gave a reconstruc- 
tion with a resolution of 5.35 A before and 4.36 A after postprocessing employing 
automatic masking, with an empirically optimized binarization threshold of 0.11, 
automated B-factor sharpening and correction of the modulation transfer func- 
tion (MTF)°’. To further improve the resolution, we performed alignment of the 
individual particle movie frames (‘particle polishing’), using 47,297 particles after 
discarding those that had missing frames. This was followed by a new refinement 
using these polished (‘shiny’) particles. We regrouped the particles into 19 groups 
and repeated the 3D refinement. Then, particles with a maximum probability dis- 
tribution value lower than 0.09 were discarded and the final round of refinement 
was run using 37,913 particles. This resulted in a reconstruction with a resolution 
of 4.65 A. This map was postprocessed using automatic masking, an empirically 
optimized binarization threshold of 0.11, automated B-factor sharpening and 
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correction of the modulation transfer function (MTF). The postprocessed map 
had an overall resolution of 4.11 A (ref. 57). 

To perform localized reconstructions, we low-pass filtered the map from the 
final round of refinement to 8 A, generated local masks by segmentation with 
Segger’® in UCSF Chimera”, and processed them with relion_mask_create. The 
subsequent refinement was performed by applying these local masks*, similar 
to Yan et al.*°. This procedure resulted in maps with generally improved inter- 
pretability for most regions of the baseplate, with the exception of the central 
non-six-fold-symmetric central spike and the gp9 protein, which displays a very 
high conformational heterogeneity. These maps after postprocessing, and their 
FSC curves, are shown in Extended Data Fig. 2g, h. 

Reconstruction of the post-attachment baseplate was initiated by grouping all 

images into 70 classes with RELION while limiting the high-resolution terms to 
10 A and performing phase flipping instead of the full CTF correction. The 5,188 
particles belonging to the best 25 classes were selected and regrouped into 25 
groups. These particles were then used for RELION 3D refinement. We created 
the starting model by extracting the baseplate from the previously published 17 A 
reconstruction of the contracted T4 phage tail (EMDB code 1089)! with the help 
of Segger®® in UCSF Chimera’’. This map was filtered to 60 A and subjected to the 
RELION 3D auto-refine procedure, resulting in a 9.50 A resolution reconstruc- 
tion. Postprocessing with an optimized binarization threshold of 0.15 improved 
the resolution to 7.49 A. A total of 5,176 particles had complete stacks of movie 
frames, which allowed us to ‘polish’ them by aligning individual movie frames. The 
resulting map had a resolution of 6.77 A. We further improved the interpretability 
of this map by postprocessing using the mask from the polishing procedure, MTF 
correction and automatic B-factor correction. 
Model building: pre-attachment baseplate. All available crystal structures of 
baseplate proteins and their domains?*°4)°-44"?-°! were fitted into the best post- 
processed map using Coot® (Supplementary Table 1). Reliable homology mod- 
els were generated with the help of HHpred®™ and Phyre2™, fitted into the map 
with Coot and rebuilt. The rest of the models were built de novo also in Coot. 
Ramachandran restraints were used for local real space refinement, which was 
performed against the postprocessed overall map or the postprocessed localized 
refinement maps (which were aligned to the postprocessed overall map, and res- 
ampled using Chimera). For the tail tube protein gp19, we generated a model 
using MODELLER® based on HHalign* alignment with gp54. Finally, taking 
advantage of the technological development of the last 15 years®°, we re-refined 
the published structure of the gp12 fragment*. The improved electron density 
allowed us to extend the existing model and complete the structure of the gp12 
repeat unit (residues 220-253; Extended Data Fig. 6b, c). This structure was then 
used to model the other repeats with MODELLER, imposing three-fold symmetry 
restraints. The N-terminal residues of gp12 were modelled on the basis of align- 
ment with the gp11 N-terminal residues found by HHpred, using MODELLER 
and three-fold symmetry restraints. 

The cryo-EM map of gp10 domains 2 and 3 was one of the most challenging 
regions for model building. Therefore, we used mass spectroscopy to verify the 
de novo built model. A disulfide bond between cysteines 331 and 342 was first 
identified in our model and then confirmed by mass spectrometry, indicating the 
correctness of the atomic model even in difficult parts of electron density. 

Model building: post-attachment baseplate. The model of the post-attachment 
baseplate was constructed with the help of the refined coordinates of the pre- 
attachment baseplate in the following way. Initially, the inner baseplate (gp6 (residues 
2-104, 170-278 and 317-411), gp7 (residues 638-753), gp25 and gp53) was fitted 
as a rigid body. Then, gp8 and gp9 were located and the remaining domains of gp6 
and gp7 were fitted with the help of the secondary structure elements visible in the 
map. Residues 841-862 of gp7 (the jump rope loop) had to be completely rebuilt. 

In the peripheral baseplate, the following modules were fitted as rigid units: 
gp10 domain 1, (gp10 domain 2)-(gp12 N-terminal domain), (gp10 domain 
3)-gp11, and gp10 domain 4. Subsequently, linker regions and a few loops were 
remodelled by rigid body fitting, regularization and real space refinement. 

The final refined model does not contain gp12. The structure of the extended 
gp12 STF is obtained by assuming that the interaction between gp10 domain 2 and 
the N-terminal domain of gp12 is preserved and that gp12 simply straightens upon 
interaction with the host cell membrane (Extended Data Fig. 2)). 

Refinement. Both structures (pre- and post-attachment baseplate) were refined 
with phenix.real_space_refine™ using global real space refinement and reciprocal 
space B-factor refinement (two B-factors per residue; Supplementary Table 2). 
The following restrains were applied in the refinement: standard Cg, secondary 
structure, Ramachandran, global six-fold symmetry and local two- and three-fold 
symmetry for the following proteins: gp9, gp11, and gp19. In the pre-attachment 
structure, the central spike (gp5.4);-(gp5)3-(gp27)3 was modelled as alternate 
conformers in its six different symmetry-related positions. The pre-attachment 
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structure served as a reference model in the refinement of the post-attachment 
structure providing additional restraints. 

To detect overfitting, we performed a limited random displacement of atoms 
in the final model with a mean of 0.5 A and refined it against one of the two maps 
from the final round of cryo-EM image processing, each calculated with half of 
the data. Both models showed similar agreement with both maps confirming that 
no overfitting took place®* (Extended Data Fig. 2c, f). 

Baseplate conformational change modelling. The post-attachment baseplate 
was aligned to the pre-attachment baseplate by least-squares superposition of the 
centres of mass of six gp53 molecules in the two conformations using LSQKAB® 
from the CCP4 suite of programs”’. Then, we calculated a morph between the 
pre-and post-attachment baseplate by means of the corkscrew interpolation 
algorithm as implemented in UCSF Chimera*’. Only two regions of the entire 
baseplate structure had physically impossible intermediates: the three chains com- 
prising the STF had independent trajectories resulting in their spatial separation, 
and the jump rope loop of gp7 (residues 841-862) overlapped with other atoms 
of the gp7 trifurcation unit insertion domain (residues 761-862). Splitting the STF 
into two separate trimeric parts comprising residues 2-98 and 99-527 resulted in 
a trajectory in which all three chains maintained their native association, but the 
fibre was now separated at its sharp kink region (residues 180-185). To overcome 
this problem, we created intermediates in which the kink region was manually 
straightened and regularized. Finally, the two ends of the jump rope loop of gp7 
were rebuilt to allow its large motion around the trifurcation unit insertion domain, 
and the insertion domain had to be slightly displaced as a rigid body to accom- 
modate this movement. 

Pseudoatomic tail models: extended tail. The pseudo-atomic model of the T4 tail 
presented in Extended Data Fig. 10 was produced by fitting the structures—as rigid 
bodies—of tail proteins obtained in this and other studies™!%?7°4143-449099-61,71 
into the cryo-EM map of the extended T4 tail (EMBD code 1126)? with the help 
of UCSF Chimera”. A crystal structure of the sheath protein gp18 lacking gp25- 
like Domain I' and a part of Domain II’ is available (PDB code 3FOA)”’. We used 
HHpred®, MODELLER®, and the structure of the R-type pyocin sheath (PDB 
code 3J9Q)'* to model the missing fragments giving rise to a nearly complete 
atomic model of the sheath protein subunit. This structure was then fitted into the 
tail cryo-EM map. The tail tube was constructed by repetitively applying the sym- 
metry transformation relating the first two gp19 hexamers in the atomic structure 
described in this work. The atomic model of the tail tube terminator protein gp3 
(ref. 72) was derived from the structure of gp19 using the same tools®*® and was 
assumed to obey the overall symmetry of the tube. 

Pseudoatomic tail models: contracted tail. Several cryo-EM reconstructions of 
the T4 tail in the contracted state are available (EMDB codes 1086, 1089, 5528)*””, 
Of these, the highest resolution (EMDB code 1086) is at 17A resolution’, but it 
lacks the fibritin whisker proteins, which are present in a more recent study that 
has a resolution of 25 A (EMDB code 5528). To facilitate the interpretation of 
these complex maps, they were brought to a common origin with the structure 
of the post-attachment baseplate (which was itself aligned to the pre-attachment 
baseplate, as described above). We then fitted the atomic structure of the post- 
attachment baseplate, all the sheath subunits, and the tube from the extended 
tail into the 17 A resolution map. The tail- and tube-terminating proteins (gp15 
and gp3, respectively), were assumed to preserve their relative orientation in the 
contracted tail, thus defining the position of the tube. The fibritin model was then 
taken from the 25 A resolution study. 

Purification, crystallization and structure determination of T4 gp25. Our origi- 
nal attempt to express gp25 resulted in an insoluble protein (Extended Data Fig. 7g, 
lanes 2 and 3). To produce gp25 in soluble form, we expressed it as a downstream 
fusion to SlyD, a known metal-affinity column contaminant, peptidyl-prolyl 
isomerase and expression enhancer’* (Extended Data Fig. 7g, lanes 4-8). The 
SlyD tag carried a polyhistidine sequence at its N terminus to increase its affinity 
to IMAC resin. The linker between SlyD and gp25 contained a cleavage site for the 
tobacco etch virus (TEV) protease. We used the TEV protease with a polyhistidine 
tag at its C terminus that we produce recombinantly in-house (Extended Data 
Fig. 7g, lane 9). The vector with the TEV-cleavable His-SlyD expression tag was 
derived from pET-23d(+) (Novagen). We named it pTSL and deposited it to the 
NCBI database under accession number KU314761. 

For cloning into pTSL, gene 25 was PCR amplified using the primers given in 
Supplementary Table 4 and T4 DNA as a template. The expression vector was trans- 
formed into methionine-auxotrophic B834/DE3 cells with ampicillin as a selection 
marker. Protein expression was performed in 2 x TY medium supplemented with 
ampicillin at 200 mg ml~'. The plasmid-carrying cells were grown to Agoo of 0.6 
and induced by addition of IPTG to a final concentration of 0.5mM (Extended 
Data Fig. 7g, lanes 4, 5). The expression was performed at 18°C with rigorous 
shaking overnight. The cells were pelleted at 8,000g, resuspended in lysis buffer 


(50 mM Tris-HCl pH 8.0, 300 mM NaCl), and disrupted by sonication at 4-16 °C. 
The insoluble fraction was removed by centrifugation at 27,000g for 20 min. The 
clarified cell lysate was loaded onto a 5-ml GE HisTrap FF Ni-charged column, 
and the protein was eluted with imidazole-containing buffer (50 mM Tris-HCl 
pH 8.0, 300 mM NaCl, 200 mM imidazole) using a step gradient (Extended Data 
Fig. 7g, lane 6). Fractions containing the protein were pooled and the TEV pro- 
tease was added to give a protease/protein ratio of 1/100 (w/w). This mixture was 
dialysed into 10 mM Tris-HCl pH 8.0 by overnight incubation at 20°C resulting 
in cleavage of the His—SlyD expression tag (Extended Data Fig. 7g, lane 7). The 
digested protein was filtered and purified by ion-exchange chromatography using a 
MonoQ 10/100 GL column and 0 to 1 M NaCl gradient in 20mM Tris-HCl pH 8.0 
buffer. Fractions containing gp25 were pooled and loaded onto a Superdex 75 
HiLoad 16/60 size-exclusion column equilibrated with 10 mM Tris-HCl pH 8.0 
and 150mM NaCl. 

For crystallization, the protein was dialysed into crystallization buffer (20 mM 

Tris-HCl at pH 8.0, 50mM NaCl), concentrated to 20 mg ml~’, and crystallized 
in 0.9-1.0M MgSO,, 100mM MES pH 6.2. The structure was solved by the 
single-wavelength anomalous dispersion method with the help of a 
SelenoMethionine (SeMet) derivative. The latter was produced using the 
SelenoMethionine Medium Complete kit (Molecular Dimensions). The SeMet 
derivative was purified and crystallized in a similar way to the native protein. 
Crystallographic data were collected at the Swiss Light Source PX-I (X06SA) beam- 
line using a PILATUS-6 M (DECTRIS Ltd.) pixel detector. For data collection, a 
crystal was briefly washed in cryoprotectant solution that contained all the com- 
ponents of the well solution at the same concentrations and 25% ethylene glycol 
and flash frozen in a vaporized liquid N2 stream at 100 K. The X-ray data were 
processed with MOSFLM”“, SCALA” and POINTLESS” (Supplementary Table 5). 
The selenium atoms were located with the help of SHELXD”’. The atomic model 
was built and refined with COOT® and PHENIX”. 
Purification of TssE-TssF-TssG-TssK complex. Recombinant expression of 
E. coli CFT073 gene c3400, which was proposed to encode a TssF-TssG fusion”, 
results in two proteins with molecular masses of ~65 kDa and ~40kDa. The 
sequence of CFT073 ‘gene’ c3400 is identical to that of genes CP009072 of E. coli 
ATCC 25922 and CP007799 of E. coli Nissle 1917, except for two missing G nucleo- 
tides after positions 3245777 and 3245794. We resequenced this fragment and 
confirmed that the missing nucleotides represent sequencing or genome assembly 
errors. We named the two correct genes c3400.1 and c3400.2. These genes encode 
proteins that show 100% sequence identity to E. coli TssF and TssG with NCBI 
references WP_000342476.1 and WP_000373314.1, respectively. 

Two PCR product-containing E. coli CFT073 genes, c3402 (TssE) and c3400. 1 
(TssF), c3400.2 (TssG) were fused by overlapping PCR using the primers given in 
Supplementary Table 4 to create a single construct with c3402 upstream of c3400. 1 
and c3400.2. The resulting PCR product was cloned into a modified pBAD33 
vector that resembled the pTSL vector described above. TssE carried a TEV pro- 
tease-cleavable His-SlyD tag at its N terminus. Gene c3387 (TssK) was cloned into 
the vector pBAD24 without any tags. Both plasmids were co-transformed into 
E. coli TOP10 cells using appropriate antibiotic selection. These cells were then 
used for protein expression. 

Protocols used for expression and purification of the TssE-TssF-TssG-TssK 
complex were similar to those used for gp25 with small modifications. The 2 x TY 
medium was supplemented with ampicillin at 200 mg ml~! and kanamycin at 
50 mg ml~'. The plasmid-carrying cells were grown to an Agoo of 0.8 and induced 
with arabinose to a final concentration of 0.1% (w/v). The expression was per- 
formed at 18°C with rigorous shaking overnight. The cells were pelleted at 8000g, 
resuspended in lysis buffer (50 mM Tris-HCl pH 8.0, 300 mM NaCl), and dis- 
rupted by sonication at 4-16°C. The insoluble fraction was removed by centrifu- 
gation at 27,000 g for 20 min. The clarified cell lysate was loaded onto a 5-ml GE 
HisTrap FF Ni-charged column. The protein was eluted with imidazole-containing 
buffer (50 mM Tris-HCl pH 8.0, 300 mM NaCl, 200 mM imidazole) using a step 
gradient. Fractions containing the protein were pooled and the TEV protease was 
added to give a protease/protein ratio of 1/100 (w/w). This mixture was dialysed 
into 10mM Tris-HCl pH 8.0 by overnight incubation at 20°C resulting in cleavage 
of the His—SlyD expression tag. The digested protein was filtered and purified 
by ion-exchange chromatography using a MonoQ 10/100GL column and 0 to 
1M NaCl gradient in 20 mM Tris-HCl pH 8.0 buffer. Fractions containing the 
TssE-TssF—TssG—TssK complex were pooled and loaded onto a Superdex 200 
HiLoad 16/60 size-exclusion column equilibrated with 10 mM Tris-HCl pH 8.0 
and 150mM NaCl. 

The TssE-TssF-TssG—TssK complex had an apparent molecular mass of 260 kDa 
(90% confidence interval is 203-331 kDa) in size-exclusion chromatography on 
a Superdex 200 10/30 GL column in 10mM Tris-HCl, pH 8.0, and 150 mM NaCl 
(Extended Data Fig. 7e, f). 
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Construction of T4-7am mutant phage. T4-7am mutant phage was constructed 
by replacing Ser45 of gene 7 with the amber stop codon by means of homologous 
recombination with a plasmid carrying this mutation. First, an overlap PCR with 
primers 7S45AmF, 7S45AmRM, 7S45AmFM and 7S45AmR (Supplementary 
Table 4) and T4 genomic DNA as a template was performed as described else- 
where®’. The resulting PCR product was then ligated into the pGem-T vector 
(Promega) following the manufacturer’s recommendations, giving rise to a 
pG7-45Am plasmid that carried the desired mutation. 

For homologous recombination, pG7-45Am was electroporated into the amber 

suppressor supD E. coli strain CR63 (ref. 81) (serU60(AS)), and the cells were 
infected with phage T4 with a M.O.I. of 5. After the completion of lysis, the lysate 
was diluted to obtain an appropriate plaque count when plated onto a CR63 lawn. 
Several hundred plaques were tested for their ability to grow on CR63 and Sup® BF 
lawns and a T4-7am mutant phage was found. The presence of the mutation was 
confirmed by Sanger sequencing. 
Complementation assay. The complementation was performed by infecting B® 
cells, which expressed the wild-type gp7 protein and its mutants, with the T4-7am 
mutant phage. First, gene 7 was PCR amplified using T4 genomic DNA as a 
template and primers T4-7F1 and T4-7R1 (Supplementary Table 4). The PCR 
fragment was then cloned into the pBAD24 vector to give rise to a plasmid in 
which gene 7 was under the control of an arabinose promoter. The plasmids for 
expression of gp7 mutants were constructed by overlap extension with pBAD for- 
ward and reverse primers (Invitrogen) as flanking oligonucleotides and the primers 
given in Supplementary Table 4 as mutagenic oligonucleotides®”. The wild-type 
gene 7 plasmid served as a template for all these PCR reactions. The mutations 
were confirmed by partial sequencing, and the expression levels were analysed 
by SDS-PAGE®”. 

The plasmids expressing all variants of gp7 and an empty pBAD24 vector 
(for negative control) were transformed into the B® strain by electroporation. Night 
cultures were diluted 100 times into an LB medium supplemented with ampicillin 
(Sigma) at 100 mg ml“! and then incubated at 37°C with shaking at 2201r.p.m. 
When the cultures reached an Agoo of 0.150 they were induced by the addition of 
arabinose (Sigma) to a concentration of 0.02% (w/v). When the cells reached an 
Agoo of 0.600, T4-7am was added with a M.O.I. of 5. Superinfection was performed 
10 min later at the same M.O.I. After 4h, a few drops of chloroform were added to 
complete cell lysis. The phages were harvested and purified by centrifugation in a 
stepped gradient of caesium chloride® and then dialysed into SM buffer (50 mM 
TrisCl pH 7.5, 100 mM NaCl, 8mM MgSOQ,). The concentration of the samples 
was normalized according to their absorbance at 260 nm and then examined using 
SDS-PAGE (Extended Data Fig. 9b). The infectivity was tested by plaque assay on 
E. coli CR63 (Extended Data Fig. 9c) and E. coli B® (for negative control). 
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Extended Data Figure 1 | Sample purification, cryo-EM imaging and 
reconstruction. a, Purification scheme of the T4 baseplate-tail tube 
complexes. b, SDS-PAGE of the sample used in cryo-EM imaging 

(the full gel is shown in Supplementary Fig. 1). c, Raw cryo-EM image of 
T4 baseplates. d, e, Representative reference-free 2D class averages of T4 
baseplates in pre- and post-attachment conformations, respectively. The 
number of particles in each class in d is as follows: 1: 534; 2: 813; 3: 689; 
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4: 655; 5: 292; 6: 858; 7: 2,223; 8: 977. The number of particles in each class 
in e is as follows: 1: 315; 2: 68; 3: 494; 4: 60; 5: 263; 6: 387; 7: 62; 8: 566. 

f, g, Distribution of refined angles of the baseplate in both conformations. 
h, Fourier shell correlation (FSC) between independently refined maps 
calculated using half of the data (gold-standard refinement) after post- 
processing for both conformations. i, Fragments of the pre-attachment 
baseplate cryo-EM reconstruction map with fitted atomic model. 


© 2016 Macmillan Publishers Limited. All rights reserved 


a oe os Resolution (A) b 


10.5 


g h 

FSC 

. Tail tube 1 
0.8 
Inner baseplate « - 

Intermediate le . 
baseplate a3 

Upper peripheral 

baseplate 02 


Lower peripheral 


baseplate me 0 1/20 1/10 


ome 


Extended Data Figure 2 | Details of local resolution estimation and 
baseplates attached to membranes. a, d, Resolution of pre- and post- 
attachment reconstructions analysed with ResMap*®’. b, e, Atomic models 
of pre- and post-attachment baseplates coloured by B-factors. c, Plots of 
the model-map FSC of the scrambled pre-attachment baseplate structure 
refined against half data map 1 from the gold-standard refinement versus 
half data map 1 (the map it was refined against) and half data map 2 
(against which it was not refined). The absence of a large gap between both 
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curves indicates that no excessive overfitting took place. f, As in c, but for 
the post-attachment baseplate. g, Localized cryo-EM refinement maps of 
the pre-attachment baseplate. h, FSC graphs of the localized refinements 
in g. i, Fit of gp9 into the pre-attachment baseplate map (the contour level 
is lower than that in Fig. 1a). j, Gaussian low-pass filtered (1/25 A) raw 
images of individual baseplates attached to cell membranes selected from 
n= 243 similar images. Asterisks, baseplates; hash symbols, membranes. 
The extended STFs connecting the two can be seen. 
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See Methods. 
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Extended Data Figure 8 | Contacts of gp7 with other baseplate proteins 
and gp7 mutagenesis. a, b, Location of the gp7 jump rope loop in the 
two conformations of the baseplate. c, The structure of gp7 is coloured 

by amino acid number from blue N terminus to white C terminus. The 
inset shows the position of gp7 (coloured blue) within the baseplate map. 
Interactions of gp7 with other baseplate proteins are shown schematically 
using the colour code of Fig. 1. The superscript i — 1 denotes interactions 
with a symmetry-related copy of a given protein. Red letters indicate sites 
of mutagenesis (for example, d/i636: deletion/insertion at position 636) 
that resulted in viable phage particles (see Supplementary Information). 
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The purple label shows the site that tolerated neither residue deletion 

nor insertion. d, SDS-PAGE of purified T4-7am particles carrying the 
wild-type (WT) and five mutant gp7 proteins after their concentrations 
were brought to a common scale according to their absorbance at 260 nm 
(the full gel is shown in Supplementary Fig. 1). e, Infectivity of the mutant 
particles shown in d. The error bar indicates the 95% confidence interval 
obtained in three independent experiments (n = 3). Only one mutation 
was statistically significantly different from the rest and P-values 
(two-tailed Student's t-test) comparing it to the wild-type phage and 

to the mutant with the largest experimental error are given. 
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Transcription initiation complex 
structures elucidate DNA opening 


C. Plaschka!*, M. Hantsche!*, C. Dienemann!, C. Burzinski!, J. Plitzko? & P. Cramer! 


Transcription of eukaryotic protein-coding genes begins with assembly of the RNA polymerase (Pol) II initiation complex 
and promoter DNA opening. Here we report cryo- electron microscopy (cryo-EM) structures of yeast initiation complexes 
containing closed and open DNA at resolutions of 8.8 A and 3.6 A, respectively. DNA is positioned and retained over the 
Pol II cleft by a network of interactions between the TATA -box-binding protein TBP and transcription factors TFIIA, TFIIB, 
TFIIE, and TFIIF. DNA opening occurs around the tip of the Pol II clamp and the TFIIE ‘extended winged helix’ domain, 
and can occur in the absence of TFITH. Loading of the DNA template strand into the active centre may be facilitated by 
movements of obstructing protein elements triggered by allosteric binding of the TFIIE ‘E-ribbon’ domain. The results 
suggest a unified model for transcription initiation with a key event, the trapping of open promoter DNA by extended 


protein-protein and protein-DNA contacts. 


For transcription initiation, Pol II assembles with the basal transcrip- 
tion factors (TF) IIB, TFIID (or its subunit TBP), TFIIE, TFIE and 
TFIIH'“ on double-stranded promoter DNA to form the closed com- 
plex (CC). Upon DNA opening, the template strand slips into the Pol 
II active centre, and a DNA ‘bubble’ forms ~20-30 base pairs (bp) 
downstream of the TATA box”, leading to the open complex (OC). 
Efficient DNA opening requires TFIIE and TFIIH® ®, but these factors 
are not required for low levels of transcription”? and TFIIE alone can 
open certain promoters'!!”, Subsequent RNA synthesis results in the 
initially transcribing complex (ITC), which is then converted into an 
elongation complex for processive RNA synthesis. 

The three-dimensional architecture of Pol II initiation complex 
intermediates was studied in yeast!*~’? and human”’ systems. These 
studies revealed that the promoter assembly containing TBP and TFIIB 
resides over the Pol II wall and positions DNA above the polymerase 
cleft and along the clamp. TFIIE and TFIIF bind to opposite sides of 
Pol II and flank promoter DNA. Owing to the limited resolution of 
available structural studies, many questions regarding the molecular 
basis of initiation remain, including the mechanisms of DNA opening 
and template-strand loading into the active centre. A differing model 
of the yeast initiation complex’ was revised recently”’, and now agrees, 
on a topological level, with other studies and results reported here. 

Here we report cryo-EM structures of CC and OC assemblies from 
the yeast Saccharomyces cerevisiae at resolutions of 8.8 A and 3.6 A, 
respectively. The structures contain all of the basal transcription fac- 
tors except TFIIH, which is currently not available at a quality required 
for high-resolution analysis. We show that DNA opening can occur 
in the absence of TFIIH, and provide mechanistic insights into DNA 
opening and template-strand loading. Our results also unveil the exact 
location and intricate structure of basal factors and their induced fold- 
ing interactions with each other, with Pol II, and with promoter DNA. 
The data also demonstrate the high structural conservation between 
yeast and human initiation systems. 


Transcription initiation complex at 3.6 A 
We extended our previous cryo-EM analysis of a Pol II core initiation 
complex containing TBP, TFIIB, and TFIIF’® by adding TFIIA and 


TFHE (Methods). Formation of a stable and stoichiometric complex 
required the presence of core Mediator, which showed low occupancy 
and high flexibility under cryo-EM conditions, as observed previously’, 
and was excluded from further analysis (Methods, Extended Data 
Fig. la-c). We acquired 257,259 cryo-EM single particle images using 
a K2 direct electron detector (Methods, Extended Data Fig. 1b, c). 
Unsupervised particle sorting led to an OC structure at an overall res- 
olution of 3.6 A, revealing the Pol II core at up to 3.1 A (OC1), compared 
to 7.8 A for our core ITC structure’? (Fig. 1a, b, Extended Data Figs 
1d-g). Further particle sorting revealed improved density for TFIIB 
and TFIIF at around 4A resolution (OC2 and OC4), and for TFIE 
at 4.4A resolution (OC3, Methods, Extended Data Figs 1i, k, 1). The 
final structure mainly consists of atomic models (90%) and contains 
backbone models for parts of the basal factors (Extended Data Fig. 1f-1). 
The structure reveals upstream DNA above the Pol IT wall and down- 
stream DNA in the active centre cleft'®. We observed only fragmented 
density for single strands within the DNA bubble (Fig. 1b, c, Extended 
Data Fig. 1m), and no density for RNA, which was not retained dur- 
ing cryo-EM sample preparation (not shown). The bubble contains 
15 mismatches, begins at the natural distance of ~20 bp downstream 
of the TATA box’, and extends further downstream than when it ini- 
tially forms, resembling the situation during transcription start-site 
scanning. For the basal factors, regions that are essential for cell via- 
bility are generally observed, whereas non-essential, non-conserved 
regions’©**-*° are often mobile (Fig. 1a, b, Extended Data Fig. 1 j-l). 
TFIIA is located near upstream DNA and TBP as expected’? (Fig. 1b, 
Extended Data Fig. 1j). For TFIIB, the B-ribbon and B-core domains 
are well defined, including the newly modelled carboxy-terminal 
cyclin domain from yeast (Fig. 2), whereas the B-linker shows weak 
density, and the B-reader'»”'” is mobile (Extended Data Fig. 1i). 


DNA positioning and retention 

TFIIF adopts an intricate fold within the OC. Its dimerization mod- 
ule and charged helix are located on the Pol II lobe domain as in the 
ITC!®!9 (Extended Data Fig. 2). The ‘arm in the large TFIIF subunit 
Tfg1 (human RAP74) adds a 8-strand to the Pol II protrusion, and 
projects into the cleft, where it may stabilize the DNA bubble'®>”” 


1Max Planck Institute for Biophysical Chemistry, Department of Molecular Biology, Am Fassberg 11, 37077 Gottingen, Germany. @Max Planck Institute for Biochemistry, Am Klopferspitz 18, 


82152 Martinsried, Germany. 
*These authors contributed equally to this work. 
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Figure 1 | Open complex structure at 3.6 A resolution. a, Domain 
organization of yeast basal transcription factors TBP, TFIJA, TFIIB, TFIIE, 
and TFIIF. Solid and dashed black bars indicate protein regions that are 
present in the OC structure as atomic and backbone models, respectively. 
Colour code used throughout. b, Two views” of the yeast OC structure. 
Pol I is in silver. DNA template and non-template strands are in dark 
blue and cyan, respectively. On the right, Pol II is shown as a surface 
representation, all other proteins are shown as ribbon models. c, Protein- 


(Fig. 2). The linker in TFIIF subunit Tfg2 (human RAP30) emanates 
from the dimerization module, and winds along the base of the 
protrusion, where it binds a hydrophobic pocket. The Tfg2 linker 
continues between the protrusion and the TFIIB cyclin domains, and 
connects to the Tfg2 C-terminal winged-helix (WH) domain on top 
of the cleft (Fig. 2, Extended Data Fig. 2f). The Tfg2 linker stabilizes 
TFIIB on the wall of Pol II?”?%. The yeast-specific amino-terminal 
region of Tfg1 binds near the Pol II external 1 domain, according to a 
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DNA contacts. Promoter DNA nucleotides are depicted with solid, shaded, 
and empty circles when they were included in the structure, excluded 
owing to weak density, or excluded owing to a lack of density, respectively. 
Solid and dashed lines indicate observed and putative protein interactions, 
respectively. A magenta dashed line indicates the contact between closed 
DNA and the TFIIE E-wing. The register of promoter DNA is given for 
analogous yeast (black) and human (grey) positions with respect to the 
transcription start site (TSS, +1). TATA box is indicated by a grey box. 


separate crystallographic analysis (Extended Data Fig. 2d, e, Extended 
Data Fig. 9c). 

TFHE is located between the clamp and the Rpb4—Rpb7 stalk (Fig. 3a, b, 
Methods, and Extended Data Fig. 3), consistent with previous top- 
ological placement of TFIIE'®”°??*! and its archaeal counterpart”. 
The TFIIE structure differs from a recent model obtained at 6 A 
resolution”! (see Extended Data Fig. le legend for details, Extended 
Data Fig. 3, and refs 16, 17). The large TFIIE subunit Tfal (human 
TFIIEq) contains an extended winged helix (eWH’) domain*’ and 
a zinc ribbon domain™ (‘E-ribbor’) that are connected by a-helices 
(called here ‘E-linker’) (Fig. 3b). The eWH domain uses its helix «3 to 
contact both the tip of the Pol II clamp helices and the DNA backbone 
at positions —13/—14 upstream of the transcription start site (TSS, 
position +1) (Fig. 1c, 3a). The E-ribbon binds between the clamp, 
the Rpb7 oligonucleotide-binding (OB) domain, and the B-ribbon 
(Fig. 3a). The E-linker?°?! and the mobile C-terminal domain 
of Tfal contact TFITH”*°, which may alter TFIIE conformation. 
The small TFIJE subunit Tfa2 (human TFIIE8) contains two WH 
domains (“‘WH1’ (ref. 36), “WH2’), and two conserved a-helices 
(called here ‘E-tether’) that bind the E-linker (Fig. 3b). Consistent 
with the structure, the E-tether is essential for TFIIE function'®*” 


Figure 2 | Basal factors position and retain DNA. Details of the upstream 
DNA assembly viewed from the side*®. Highlighted are the locations 

of the Pol II wall (navy blue), protrusion (orange), TBP (red), TFIIB 
(green), TFIIF arm (purple), Tfg2 linker (dark magenta) and winged 
helix (light purple), TFIIE Tfa2 WH1 (light salmon), DNA (template, 
blue; non-template, cyan), and the active site (magenta sphere). TFIIA 
is transparent (light yellow). Cryo-EM densities for newly modelled 
regions of TFIIB, TFIIF, and TFIIE are superimposed on their structural 
models. Interactions with the Pol II protrusion and upstream edge of 
the DNA bubble are indicated. TBP contacts a density assigned to the 
Tfg2 C-terminal region, consistent with interaction of their human 
counterparts~’ (Extended Data Fig. 2f). 
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Figure 3 | TFIIE architecture and interactions. a, TFIIE interactions 
within the OC. Depicted are interactions of the TFHE E-ribbon with the 
Pol II clamp, stalk subunit Rpb7, and the TFIIB B-ribbon, and interactions 
of the TFITE eWH domain with the Pol II clamp helices and upstream 
DNA. The eWH E-wing lies close to the upstream DNA edge, similar to 
WH domains involved in DNA strand separation (Extended Data Fig. 3)). 
Colours as in Fig. 1, except for the Pol II stalk (Rpb4, dark red; Rpb7, 

dark blue). b, TFIIE domain architecture. The TFIIE variants used for 
functional assays are indicated as Ca spheres for point mutations, and with 
a black bracket for E-wing alterations (compare Extended Data Fig. 3g, h, j). 
Connectivity of the Tfa2 E-tether helices is uncertain. c, Selected TFIIE 


and subunit dimerization (Extended Data Fig. 3a, refs 16, 38). The 
structure further indicates that TFIIE must be displaced or at least 
moved before the elongation factor Spt4/5 (human DSIF) can bind 
to polymerase*”’, 

The OC structure thus reveals how TFIIF and TFIIE bind open pro- 
moter DNA from opposite sides of the Pol II cleft. First, TFIF adopts 
an extended induced structure that allows it to retain the upstream 
DNA-TBP-TFIIB assembly on the wall and to bind the DNA bubble 
and downstream duplex in the cleft (Figs 1b, 2). Second, the TFIIF 
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Figure 4 | Closed complex cryo-EM structure. a, Details of the closed 
complex viewed from the front°’. Highlighted are TFIIE, TFIIF Tfg2 WH, 
and DNA, superimposed on their density. The promoter DNA displays 
increased flexibility downstream of the E-wing contact at position —7 
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variants impair transcription from a HIS4 promoter (Methods, Extended 
Data Fig. 3g, h). TFIIE-depleted nuclear extract (NE) was reconstituted 
with recombinant TFHE or TFIE variants carrying mutations in the Tfal 
eWH (M1, Tfal(N50E/K51E/T52E); M2, Tfal(N50A/K51A/T52A); M3, 
Tfal(P56A/A59E/R62E); M4, Tfal(AE-wing); M5, Tfal(poly-Ala E-wing) 
and the Tfal E-ribbon (M6, Tfal(L134E/V137E/L140E); M7, Tfal(L134A/ 
V137A/L140A) (Extended Data Fig. 3g). RNA products were visualized 
by primer extension and the mean intensity and standard deviation (s.d.) 
from triplicate experiments are provided, relative (rel.) to the activity 

of wild-type TFIIE. An asterisk marks RNA products resulting from an 
alternative upstream transcription start site. 


Tfg2 WH domain and the TFIIE Tfa2 WH1 domain contact each 
other above upstream DNA to encircle and retain DNA. Third, the 
eWH domain of TFHE binds DNA in the region of initial DNA open- 
ing and its long 81-82 hairpin® (called here the ‘E-wing’) projects to 
the upstream edge of the bubble (Fig. 3a), suggesting that the eWH 
domain stabilizes open DNA. Taken together, the highly modular and 
flexible basal factors TFIIF and TFIIE undergo substantial induced 
folding transitions to engage in multiple protein-DNA and protein- 
protein interactions to stabilize the OC. 
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upstream of the TSS (+1). b, Different positions of the TFIIE eWH in 
closed (dark magenta) and open (light magenta) complexes, viewed from 
the top*°. Compare Extended Data Fig. 4e-g. 
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Figure 5 | Cleft clearance and DNA template loading. a, OC structure 
viewed from the top>’. Highlighted are the Pol II active site (magenta 
sphere), fork loop 1 (yellow), lid (dark red), rudder (magenta), wall 
(navy blue), dock (brown), zipper (dark green), TFIIB B-ribbon (green), 
TFIE E-ribbon (magenta) and downstream DNA (template, dark blue; 
non-template, cyan). The template single-strand was modelled using the 
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fork loop 1, lid, and rudder in the OC (left), and in a Pol II-TFIIB ITC” 
(PDB: 4BBS, right). Movement of the Pol II lid (left, black to dark red) leads 
to a steric clash with the B-reader (cyan). Compare Extended Data Fig. 5a-c. 
c, Allosteric binding of the TFIIE E-ribbon may lead to an altered position 
of the TFIIB B-ribbon. Movements in Pol II wall and flap loop (navy blue), 
dock (brown), zipper (dark green), and B-ribbon (green) are observed 


in presence of TFIIE compared to the crystal structures of the binary 

Pol II-TFIIB complex’? (dark grey, PDB: 3K1F) and Pol II-TFIIB ITC’® 
(light grey, PDB: 4BBS). The altered B-ribbon position may be stabilized by 
binding to a short helix formed in loop 812-613 of the dock domain. 


Pol II-TFIIB ITC” crystal structure. b, Fork loop 1 and lid assume new 
positions in the OC compared to the ITC’? and this opens a path (arrow) 
for loading of the template DNA strand (blue) into the active site (magenta 
sphere). Surface representations of Pol II cleft (silver), and cleft elements 


for complex preparation. DNA opening occurred in the absence of 
TFIIH (Extended Data Fig. 4h, i). From these particles we obtained an 
independent reconstruction of the spontaneously formed OC at 6.1 A 
resolution (OC5, Extended Data Fig. 4h). Weak density for upstream 
and downstream DNA segments indicates that DNA bubbles of var- 
ious sizes formed during DNA opening (Extended Data Fig. 4i). The 
reconstruction resembles the high-resolution OC structure, suggest- 
ing that the latter was not perturbed by the use of pre-opened DNA 
(Extended Data Fig. 4h). 


DNA opening and loading 

Modelling of a closed DNA promoter onto the OC structure shows 
that closed DNA would clash with the TFIIE eWH domain. This sug- 
gests that the eWH domain adopts a different position before DNA 
opening. To investigate this, and to provide insights into the transi- 
tion from the CC to the OC, we repeated structure determination 
with closed DNA instead of pre-opened DNA (Methods, Extended 
Data Fig. 4). Surprisingly, cryo-EM analysis revealed that about 3 out 
of 4 particles contained open DNA, although closed DNA was used 
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and adjusting the clamp to the position observed in the human CC. Shown 
are cryo-EM densities for DNA, Tfg2 WH, and TFIIE. DNA positions 
—10, —7 (yeast CC) and +1 (TSS) are labelled. DNA was extended by one 
turn for the yeast CC (black bracket). The locations of TFIHA and TFIIE 

in the ITC were inferred from the yeast OC. Obstructing Pol II and TFIIF 
regions were removed for clarity. b, Schematic representation of a. Key 
elements for DNA opening are indicated (compare Extended Data Fig. 6). 


Figure 6 | Model for DNA opening during transcription initiation. 

a, Gallery of initiation complexes depicting proposed movements (arrows) 
of DNA and basal factors during the transition from the CC to OC to ITC, 
from left to right, viewed from the side*’. Yeast CC and OC structures 
(this work) were complemented with our previous yeast ITC”’ structure 
(EMD-2785) and an alternative model of the CC (human CC’), which was 
obtained by replacing the DNA with that in the human CC”? (EMD-2306), 
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From the remaining particle images we obtained a reconstruction 
of the CC at 8.8 A resolution (F ig. 4a). Comparison of this CC recon- 
struction with the OC structure reveals movements mainly in TFITE 
(Fig. 4b, Extended Data Fig. 4e-g). In the CC, the E-wing lies on top 
of the DNA around position —7, in the region where DNA opening 
begins’ (Extended Data Fig. 4e). Consistent with this contact, Tfal (or 
the human counterpart TFHEq) crosslinks to DNA near this point in 
the CC**4! (Extended Data Fig. 4e). Conversion of the CC to the 
OC involves movement of upstream DNA and the DNA-associated 
domains Tfa2 WH1 and Tfg2 WH towards the cleft. DNA opening 
allows the TFIE eWH domain to bind the tip of the clamp, and ena- 
bles the E-wing to move near the upstream edge of the DNA bubble. 
Consistent with these changes, crosslinks between upstream DNA and 
the large TFIIE subunit are altered when the CC is converted to the 
OC”. These observations indicate that DNA opening involves TFIIE, 
and in particular the eWH domain. 

TFIIE may also be involved in loading of the DNA template strand 
into the active centre during the transition from the CC to the OC. In 
previous structures, the path for loading the template was obstructed 
by the TFIIB B-reader and the Pol II fork loop 1 and lid. However, 
the B-reader is mobile in the OC structure, and fork loop 1 and the 
lid are moved to provide a path for template-strand loading (Fig. 5b, 
Extended Data Fig. 5a—c). These movements are apparently trig- 
gered by TFIIE because binding of the E-ribbon leads to a shift in the 
B-ribbon that partially withdraws the B-reader from the cleft (Fig. 5c). 
Thus allosteric binding of the E-ribbon apparently induces ‘clearance’ 
of the Pol II cleft that may facilitate template-strand loading into the 
active centre, and transcription start-site scanning in yeast. 

To support the proposed functions of structural elements in TFIIE, 
we prepared recombinant TFIE variants and tested them for binding 
to the CC and for promoter-dependent transcription activity in yeast 
nuclear extract (Methods, Fig. 3c, Extended Data Fig. 3g—i). Mutation 
of only three surface residues in the E-ribbon that contact the Pol II 
subunit Rpb7 strongly impaired both binding to the CC and tran- 
scription activity, and led to a severe growth defect in yeast (Fig. 3c, 
Extended Data Fig. 3h, j). Further, the eWH domain is required for 
TFIIE function’®, and disruption of the eWH contact with DNA by 
introducing negatively charged glutamate residues in the eWH helix 03 
leads to a transcription defect and impairs binding to the CC (Fig. 3c, 
Extended Data Fig. 3h). Other mutations in the eWH domain did not 
show functional defects in our assays. Deletion of the E-wing results in 
a mild growth phenotype (Extended Data Fig. 3j), and does not impair 
in vitro transcription (Fig. 3c), maybe because TFIIH compensates for 
the loss of E-wing function in these assays. 


Model of transcription initiation 

Structural comparisons of the yeast CC, OC, and ITC’’ with each other 
and with the highly conserved human CC”? reveal differences in the 
positions of DNA, the clamp, and TFIIE, and lead to an extended model 
of DNA opening (Fig. 6a, b, Extended Data Fig. 6). In this model, pro- 
moter DNA is initially bent away from the active site by ~20° near 
position —10 at the tip of the clamp helices (yeast CC). The clamp 
then opens slightly (Extended Data Fig. 6b), allowing promoter DNA 
to bend in the opposite direction and to enter the upper part of the cleft 
(human CC’). This frees the site at the tip of the clamp helices that can 
now bind the TFIIE eWH domain. DNA can then no longer swing 
back to its initial position, because this would result in a steric clash 
with the repositioned eWH domain. However, when DNA opening 
occurs around position —10, the upstream DNA can swing back to 
its original position and this stabilizes the DNA duplex single-strand 
junction. TFIIH then rotates downstream DNA and pushes the 
template single strand into the cleft using its ATP-dependent trans- 
locase activity, as suggested previously'®. When the bubble extends 
downstream”, the template strand is loaded into the active centre cleft 
via a path cleared by allosteric binding of TFHE (yeast OC). Template- 
strand loading allows the clamp to close again and to trap downstream 
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DNA in the cleft (Extended Data Fig. 6b). The B-reader then covers 
the template strand, and helps to detect the transcription start site!’, 
triggering RNA synthesis (yeast ITC). 

This model explains how DNA opening can be achieved with the use 
of binding energy alone, at least at some promoters!»*. DNA open- 
ing allows for new protein interactions at the upstream DNA duplex 
single-strand junction involving the eWH domain, the TFIIF arm, 
and the Pol II clamp. DNA loading into the cleft also enables new 
interactions of the downstream DNA duplex with the TFIIF charged 
helix and the Pol II cleft and clamp. These additional contacts may 
compensate for the energy needed for DNA melting and help to trap 
open DNA and to prevent its re-closure. 

A similar mechanism for ATP-independent DNA opening may 
be used in other transcription systems. Proteins with homologies 
to TFIIB, TFIE, and TFUF, but not TFIIH, are present in the Pol I 
and Pol III systems’, and counterparts of TBP, TFIIB, and TFIIE are 
found in archaea*®. The bacterial transcription initiation system is 
structurally unrelated, but conceptually similar. DNA opening occurs 
spontaneously above the cleft, the open DNA is trapped with the use 
of binding energy, and this requires clamp opening and closure**”’. 
DNA opening at a subset of bacterial genes however requires the ATP- 
dependent initiation factor o**, and this enables further regulation®®. 
Similarly, Pol II regulation can occur at the level of DNA opening”, 
and TFIIH is required to keep DNA open at least at selected promot- 
ers®. This suggests that the Pol II initiation system evolved to depend 
on the ATP-consuming factor TFIIH for DNA opening, probably in 
response to an expanded need for gene regulation. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized and the investigators were not blinded to 
allocation during experiments and outcome assessment. 

Vectors and sequences. The open reading frame (ORF) encoding full-length TBP 
was amplified from Saccharomyces cerevisiae (Sc) genomic DNA and cloned into 
a pOPINE vector, containing a C-terminal 6 x histidine tag. Codon-optimized 
Tfg1, Toal, and Toa2 were commercially obtained for expression in Escherichia 
coli (E. coli, Life Technologies). Toal (A95-209) was obtained by quick-change 
PCR and cloned with Toa2 into a pOPINE vector, containing a C-terminal 
6 xhistidine-tag on Toa2. Tfal and Tfa2 ORFs were amplified from genomic Sc 
DNA and cloned sequentially into a pET vector. ORFs for Tfg1 and Tfg2 were cloned 
sequentially into a modified pET-Duet-1 vector, containing an N-terminal 10x 
histidine-8 x arginine-SUMO-tag on Tfg1. Additional ribosomal binding sites were 
introduced as described?’ For heterologous co-expression of Sc 16-subunit core 
Mediator (cMed)-Med1, 13-subunit cMed lacking Med4—Med9 was cloned with 
previously described vectors'’. Three-subunit Med1-Med4—Med9 was prepared 
by cloning Sc ORFs of Med1, Med4, and Med?9 into a modified pET-Duet-1 vector, 
containing an additional N-terminal 10 x histidine-8 x arginine-SUMO-tag on 
Med1. Sequences are available upon request. 

Recombinant proteins. All proteins were expressed in E. coli BL21(DE3)RIL cells 
(Stratagene). The identity of all purified proteins was confirmed by mass spectrom- 
etry. All purified proteins and complexes were flash-frozen and stored at —80°C. 

Full-length TBP was expressed in E. coli that were grown in lysogeny broth (LB) 
medium at 37°C to an optical density (OD) of 0.5 at 600 nm, and induced with 
0.5 mM isopropyl-B-p-thiogalactoside (IPTG) for 4h at 20°C. Cells were lysed 
by sonication in buffer A (25mM HEPES (pH 7.5), 500mM KCI, 10% glycerol, 
2.5mM dithiothreitol (DTT)) containing protease inhibitors°’. The soluble frac- 
tion was applied to a 5 ml Histrap HP column, washed with buffer A containing 
1M KCl, and eluted with buffer A containing 350 mM imidazole. The sample was 
diluted 1:4 with buffer A lacking KCl and applied to anion-exchange chroma- 
tography using a MonoS 5/50 (GE Healthcare), and eluted with a linear gradient 
of buffer A from 100-1000mM KCI. TBP was further purified by size-exclusion 
chromatography using a Superose 12 10/300 (GE Healthcare) column, equilibrated 
in buffer B (10 mM HEPES (pH 7), 200 mM NaCl, 5% glycerol, 2mM DTT). TBP- 
containing fractions were pooled and concentrated to 8 mg ml7!. 

TFIIA was obtained by co-expression of its subunits Toal and Toa2 in E. coli. 
Transformed cells were grown in LB medium at 37°C to an OD of 0.5 at 600 nm. 
Expression was induced with 0.5mM IPTG at 37°C for 4h. Cells were lysed in 
buffer C (25 mM Tris-HCl (pH 8.0), 500 mM NaCl, 10% glycerol, 2mM DTT) 
containing protease inhibitors”, and cleared by centrifugation. The supernatant 
was applied to a 5-ml HisTrap HP column, washed with buffer C containing 1M 
NaCl, and eluted with buffer C containing 250 mM imidazole. Fractions contain- 
ing the complex were pooled, diluted 1:4 with buffer C lacking NaCl, loaded on a 
MonoQ 5/50 anion-exchange column, and eluted with a linear gradient of buffer C 
from 100-500 mM NaCl. TFIIA containing fractions were pooled and applied to a 
Superdex 75 10/300 column (GE Healthcare), in buffer B. TFIIA was concentrated 
to 8mgml !. TFIIB was prepared as described". 

Recombinant TFIIE was obtained by co-expression of its subunits Tfal and Tfa2 
in E. coli. Cells were transformed and grown in LB medium at 37°C to an OD of 0.6 
at 600 nm, and expression was induced by 0.5mM IPTG for 18h at 18°C. Cells were 
lysed by sonication in buffer D (50 mM Tris-HCl (pH 8.0), 300 mM NaCl, 0.02% 
Tween-20, 5mM DTT) containing protease inhibitors’. The lysate was cleared by 
centrifugation and applied to a 5 ml Histrap HP column, equilibrated in buffer E 
(buffer M, lacking Tween-20, containing 10 mM imidazole). The column was 
washed with 10 column volumes of buffer E and eluted with buffer E containing 
250 mM imidazole. TFIIE was then subjected to anion-exchange chromatography 
using a 5-ml Hi-Trap Heparin column (GE Healthcare), equilibrated in buffer F 
(50 mM Tris-HCl (pH 8.0), 100 mM NaCl, 2% glycerol, 5mM DTT). The com- 
plex was eluted with a linear gradient of buffer F from 100-2000 mM NaCl. To 
improve purity, TFIIE was further applied to a Superose 12 10/300 size-exclusion 
column, in buffer G (5mM HEPES (pH 7.25), 40 mM ammonium sulphate, 10 1.M 
ZnCl,, 10mM DTT). TFIE containing fractions were pooled and concentrated 
to9.6mgml". 

Sc TFIIF subunits Tfg1 and Tfg2 were co-expressed in E. coli and cells were 
grown in LB medium at 37°C to an OD of 0.8 at 600 nm. Expression was induced 
with 0.2 mM IPTG for 3 h at 37°C. Cells were lysed by sonication in buffer H 
(50mM HEPES (pH 7.0), 350mM KCl, 10% glycerol, 2mM DTT) supplemented 
with 50 mM imidazole and protease inhibitors°”. Cleared lysate was applied to a 
5-ml HisTrap HP column equilibrated in buffer H. The column was washed with 
8 column volumes of buffer H containing 1000 mM KCI, and eluted with a linear 
gradient from buffer H to buffer I (50mM HEPES (pH 7.0), 250 mM KCl, 800 mM 
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imidazole, 10% glycerol, 2mM DTT). The conductivity of the eluate was adjusted 
to match that of buffer J (50 mM HEPES (pH 7.0), 150mM KCl, 10% glycerol, 
2mM DTT) and 3C protease cleavage was carried out for 2h. The complex was 
then applied to cation-exchange chromatography using a 1-ml HiTrap SP HP 
column (GE Healthcare), equilibrated in buffer J, and eluted in a linear gradient 
from 150-1000 mM KCI. TFIIF was further purified by size-exclusion chromatog- 
raphy using a Superdex 200 10/300 Increase column (GE Healthcare), in buffer K 
(10mM MES (pH 6.2), 150 mM KCl, 10% glycerol, 2mM DTT). Purified TFIIF 
was concentrated to 3.9 mg ml !. For previous studies of the initiation complex we 
used the conserved S. mikatae Tfg1 (refs 18, 19), owing to difficulties in recombi- 
nant expression of its Sc homologue. Here, expression of Sc Tfg1 was enabled by 
codon optimization. 

For preparation of recombinant 16-subunit CMed—Med1, 13-subunit cMed 
lacking Med4—Med9 was prepared essentially as described!’. Separately, a 
plasmid containing Med1-Med4-Med9 was transformed in E. coli and cells 
were grown in LB medium at 37°C to an OD of 0.6 at 600 nm. Protein expres- 
sion was induced with 0.5mM IPTG at 18°C for 24h. Cells were collected and 
lysed by sonication in buffer L (25mM HEPES (pH 7.5), 400mM potassium 
acetate, 10% glycerol, 20mM imidazole (pH 8), 2mM DTT) containing pro- 
tease inhibitors*”. Lysate was cleared by centrifugation and applied to a 5-ml 
Histrap HP column (GE Healthcare), equilibrated in buffer M (25 mM HEPES 
(pH 7.5), 400 mM potassium acetate, 10% glycerol, 30 mM imidazole (pH 8), 
2mM DTT). The column was washed with 5 column volumes of buffer M and 
a linear gradient over 10 column volumes from 30-100 mM imidazole. The 
heterotrimer was eluted with 300 mM imidazole over 10 column volumes. 
Fractions containing the complex were diluted 1:3 with buffer N (25mM HEPES 
(pH 7.5), 100 mM KCl, 10% glycerol, 1 mM EDTA, 5mM DTT) and incubated 
with 3C protease for 2h on ice, to remove the affinity tag. The complex was 
further purified by anion-exchange chromatography using a MonoQ 5/50 GL 
column (GE Healthcare), equilibrated in buffer N, and was eluted with a linear 
gradient from 100-600 mM KC] over 150 column volumes. Fractions containing 
the complex were pooled and applied to size-exclusion chromatography using 
a Superose 6 10/600 column (GE Healthcare), equilibrated in buffer O (25mM 
HEPES (pH 7.5), 200mM KCl, 5mM DTT). Purified Med1-Med4—Med9 complex 
was concentrated to 2.7 mg ml"'. To prepare 16-subunit cCMed—Med1, 13-subunit 
cMed was incubated with a twofold molar excess of Med1-Med4—Med9 for 
30 min at 25°C and 30 min on ice, and purified by size-exclusion using a Superose 
6 10/600 column, in buffer P (25 mM HEPES (pH 7.5), 400 mM KCl, 5% glycerol, 
5mM DTT). Fractions containing cMed—Med1 were pooled and concentrated 
to2mgml]"!, 

Preparation of initiation complexes. Yeast 12-subunit Pol II was prepared as 
described**. Open and closed initiation complexes were prepared with several 
modifications of the previous cITC-cMed assembly scheme” and with a different 
nucleic acid scaffold for the closed complex. The 72 nucleotide nucleic acid scaffold 
previously used to prepare the Pol II core ITC (cITC)!” contains a 15 nucleotide 
mismatch transcription bubble and six nucleotide RNA, and was used for assem- 
bly of the open initiation complex (OC). The closed complex (CC) contained a 
nucleic acid scaffold with 13 nucleotide longer downstream DNA, based 
on the HIS4 promoter (template, 5‘-TGATATTTTTATGTATGTACA 
ACACACATCGGAGGTGAATCGAACGTTCCATAGCTAT TATATACACAGCG 
TGCTACTGTTCTCG-3’; non-template, 5’-CGAGAACAGTAGCACGCTGTGT 
ATATAATAGCTATGGAACGTTCGATTCACCTCCGATGTGTGTTGTACATA 
CATAAAAATATCA-3’). The initiation complex was prepared as follows. Pol II 
(200 1g at 3mg ml) was incubated with a fourfold molar excess of TFIIF. A two- 
fold molar excess of nucleic acid scaffold over Pol II, tenfold molar excess of TFA, 
fourfold molar excess of TBP and TFIIB were added to buffer Q (25 mM HEPES- 
KOH (pH 7.5), 150mM potassium acetate, 5% glycerol, 2mM MgCl, 5mM DTT) 
and incubated with pre-formed Pol II-TFIIF complex for 8 min at 25°C. TFIIE 
was added in a tenfold molar excess over Pol II and incubated for 5 min at 25°C. 
cMed-Med1 was added in a 1.2-fold molar excess over Pol II and incubated for 
50 min at 25°C. Open and closed complexes were purified using a Superose 6 
3.2/300 size exclusion column (GE Healthcare), equilibrated in buffer Q. Fractions 
containing the complex were pooled (0.4—0.8 mg ml") and additionally incubated 
with equimolar amounts of nucleic acid scaffold. The sample was then crosslinked 
for 30 min on ice using 0.1% glutaraldehyde (Electron Microscopy Sciences), and 
the reaction was quenched with 50 mM lysine (Sigma). The crosslinked sample 
was re-purified in a second size-exclusion step using a Superose 6 3.2/300 column, 
equilibrated in buffer Q lacking glycerol. Fractions containing initiation complexes 
were pooled (0.2-0.6 mg ml) and used for EM grid preparation. 

Electron microscopy. Initiation complex samples were applied to R3.5/1 holey 
carbon grids (Quantifoil). Grids were glow-discharged for 15s before deposi- 
tion of 4.511 complex, and subsequently blotted and vitrified by plunging into 
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liquid ethane with a Vitrobot Mark IV (FEI) operated at 4°C and 100% humidity. 
Cryo-EM data was acquired on a FEI Titan Krios operated in EFTEM mode at 
300 keV, and equipped with a K2 Summit direct detector (Gatan). Automated data 
collection was carried out using the TOM toolbox* to acquire 1756 movies of 
the OC with a range of defocus values (from —0.7 1m to —4.2 um) at a nominal 
magnification of 37,000 (1.35A per pixel). The camera was operated in 
‘super-resolution’ mode (0.675 A per pixel), with a total exposure time of 10s 
fractionated into 25 frames, a dose rate of ~5 e~ per pixel per second, and total 
dose of 33 e~ A~? per movie. Cryo-EM data of the CC was collected in the same 
manner, except that 959 movies were acquired with a defocus range from —0.8 ym 
to —5.m, a total exposure time of 6s with 20 frames, a dose rate of ~8 e~ per pixel 
per second, and total dose of 40 e~ A~?. Movies were aligned as described!*®, 
except that images were not partitioned into quadrants. 

Image processing. For single particle analysis of the OC, an initial set of 10,225 
particles was selected semi-automatically using e2boxer.py from EMAN2 (ref. 56). 
CTF parameters were estimated using CTFFIND4 (ref. 57). CTF correction and 
subsequent image processing was performed with RELION 1.3 (ref. 58), unless 
otherwise noted. Resolution was reported on the basis of the gold-standard Fourier 
shell correlation (FSC) (0.143 criterion) as described previously”? and tempera- 
ture factors were determined and applied automatically in RELION”®. Selected 
particles were extracted with a 3007 pixel box and pre-processed to normalize 
images. Reference-free two-dimensional (2D) class averages were calculated, and 
twelve representative classes were low-pass filtered to 25 A resolution and used 
as templates for automated picking™ of all micrographs. The resulting 415,030 
particle images were screened manually and by reference-free 2D classification, 
yielding 257,259 particle images that were used for subsequent processing. The 
7.8 A cryo-EM map of the yeast cITC!” (EMD-2785) was low-pass filtered to 50 A 
and used as initial model for 3D refinement of the 10,225 particle set. This revealed 
an OC density to an estimated resolution of 10 A. This density was low-pass filtered 
to 50A and used for processing of the complete OC cryo-EM data set (Extended 
Data Fig. 7a—d). A 3D reconstruction of all particles was calculated to 3.75 A, and 
subjected to particle polishing using RELION 1.4 beta”®. This lead to an improved 
density at 3.58 A resolution. 

Hierarchical 3D classification was carried out without image alignment, to 
reduce computational requirements and identify homogeneous single particle 
groups (Extended Data Fig. 7a—d). Soft masks encompassing the complete OC or 
smaller regions of Pol II and basal factors were generated using the volume eraser 
in UCSF Chimera‘! and RELION™. This included masks for Pol II stalk-TFIIE 
E-ribbon (Extended Data Fig. 7a, c), upstream DNA-TBP-TFIIA-TFIIB-Tfg2 
linker (Extended Data Fig. 7b), TFITE-Tfg2 WH (Extended Data Fig. 7c), TFIIF 
(Extended Data Fig. 7d), and TFIIF dimerization domain (Extended Data Fig. 7d). 
Each class was refined using the 3D auto-refine procedure against the respec- 
tive particles within that class with a soft reference mask in the shape of the OC 
(maximum diameter of 270 A), generated in RELION**. The OC1 reconstruction 
(improved Pol II core, B-ribbon, and E-ribbon density) was determined from 
102,876 particles to a resolution of 3.58 A and with a temperature factor of —111 A? 
(Extended Data Fig. 8a). The OC2 reconstruction (improved upstream DNA, TBP, 
TFIIA, TFIIB, and Tfg2 linker density) was determined from 17,282 particles to 
a resolution of 3.97 A and with a temperature factor of —95 A? (Extended Data 
Fig. 8b). The OC3 reconstruction (improved TFIE density) was determined from 
11,231 particles to a resolution of 4.35 A and with a temperature factor of —125 A? 
(Extended Data Fig. 8c). The OC4 reconstruction (improved TFIIF dimerization 
and Tfg1 arm density) was determined from 29,455 particles to a resolution of 
3.89 A and with a temperature factor of —79 A (Extended Data Fig. 8d). Focused 
refinement of the upstream DNA assembly (OC2-focused) was achieved by con- 
tinuing the auto-refinement (round 1, class 4; Extended Data Fig. 7c) from the 
first round of local searches using a corresponding soft mask. Focused refinement 
of TFIIE-Tfg2 WH (OC3-focused) was achieved by continuing the OC3 auto- 
refinement from the first round of local searches using a soft mask encompassing 
TFIIE, Tfg2 WH, and the interacting segment of upstream DNA. Focused refine- 
ment of TFIIF dimerization domain (OC4-focused) was achieved by continuing 
the auto-refinement (round 2, class 2; Extended Data Fig. 7d) from the first round 
of local searches using a soft mask encompassing the TFIIF dimerization domain 
and the Pol II lobe. The resolution of focused refinements was determined using 
a soft mask with a 30-pixel soft edge™, to 4.7 A, 7.5 A, and 4.09 A for OC2-, OC3-, 
and OC4-focused refinements respectively (Extended Data Fig. 8b, c, d). 

Single particle cryo-EM analysis of the closed complex was carried out essen- 
tially as for the open complex, with the following differences. Reference-free 2D 
classification of 10,591 particles, picked semi-automatically using e2boxer.py from 
EMAN2”, gave three representative 2D class averages that were low-pass filtered 
to 30 A resolution and used as templates for automated particle picking in RELION 
of all micrographs™. The resultant 155,079 particles were screened manually and 
by reference-free 2D classification, yielding 111,625 particles that were used for 


subsequent processing. The cITC (EMD-2785), low-pass filtered to 50 A, was 
used as the reference model for the initial 10,591 particle set, and the resultant 3D 
reconstruction (14 A resolution), again low-pass filtered to 50 A, was used as initial 
reference model for 3D refinement using the full cryo-EM data set. This revealed 
the CC at 7.5 A, and was improved by particle polishing to 6.5 A resolution using 
RELION 1.4 beta®’. Hierarchical 3D classification without image alignment using 
a soft mask encompassing the complete CC resulted in two populations, CC and 
OCS, respectively (Extended Data Fig. 7e). Both classes were refined using the 
3D auto-refine procedure against the respective particles within that class with a 
soft mask in the shape of the CC. The CC soft mask was also suitable for the OC 
as both complexes have a similar shape. The CC reconstruction was determined 
from 7,527 particles to a resolution of 8.2 A. The OCS5 reconstruction was deter- 
mined from 79,797 particles to a nominal resolution of 6.1 A and a temperature 
factor of —176 A? was applied (Extended Data Fig. 8g). To improve the densities 
TFIIE, Tfg2 WH and downstream DNA in the CC, we carried out focused clas- 
sification of the Pol II stalk and subsequently TFIIE, Tfg2 WH and downstream 
DNA using soft masks encompassing these regions. The individual classes were 
refined as before and yielded a CC at 8.8 A comprising 5,690 particles and a tem- 
perature factor of —300 A? was applied (Extended Data Fig. 8f). A similar focused 
classification scheme for OC5 revealed moderately improved density for TFIIE 
(not shown). 

Local resolution estimates were determined using a sliding window of 40° vox- 

els as previously described except that a single pair two half-maps was used and 
resolution estimates were not capped at the nominal resolution” as no local filters 
were applied. 
Structural modelling. A composite model of the OC was obtained using cryo-EM 
densities OC1-OC4 and the focused refinements of OC2, OC3, and OC4 particles. 
Structural models were built in COOT® unless indicated otherwise. Models were 
refined using the real space refinement routine in Phenix™ into the respective 
OC density, as indicated, with secondary structure and rotamer restraints. First, 
structural models of Pol II (lacking Rpb4—Rpb7), TFIIB B-ribbon (residues 22-59), 
and downstream DNA (PDB: 4V1N!”) were placed into the OC1 map using UCSF 
Chimera®’, followed by rigid-body group refinement in Phenix®. Models of the 
Pol II core and TFUB B-ribbon were adjusted and extended manually, and refined 
in Phenix™ into the OC1 density. The Rpb4—Rpb7 structure (PDB: 4V1N) was 
fitted, and residues at the base of the stalk (Rpb7 1-81, 150-159) were modified 
to fit the density and refined into the OC1 map. TFIIE was modelled using struc- 
tural information on eWH*, E-ribbon*™4, and WH1°%, and a homology model for 
WH (Fig. 3a, b). A homology model of the human TFITEa E-ribbon (PDB: 1VD4 
(ref. 34)) was generated using MODELLER® (Tfal residues 121-158) and 
rigid-body fitted into the unsharpened OC1 density. Upstream DNA (PDB: 
4V1N), TFIIB N-terminal cyclin (PDB: 4BBR, chain M and residues 122-213), 
the homology model of TFIIB C-terminal cyclin domain (PDB: 4V1N, chain M 
and residues 233-343) were fitted into the OC2 cryo-EM density. Models of Pol 
II protrusion, wall and the TFIIB N-terminal cyclin were adjusted and extended 
manually, and the TFIIB C-terminal cyclin (residues 236-328) and Tfg2 linker 
regions (residues 249-280) were built into OC2 and OC2-focused maps and 
refined into the OC2 map in Phenix™. X-ray structures of TBP (PDB: 1YTB®, 
chain A), and TFIIA (PDB: 1YTF”, chains B—D) were individually fitted into the 
OC2 map obtained from focused refinement (OC2-focused). The fit of upstream 
DNA, from the upstream edge of the DNA bubble to TBP, was improved in UCSF 
Chimera®! using the OC3 map. Protein models of Tfal eWH (residues 1-90) and 
Tfa2 WH2 (residues 187-249), generated with the I-TASSER prediction server®’, 
were fit into the OC3 density and adjusted. The TFIIE linker helices were built 
de novo and modified to match the density, and together with Tfal eWH and 
Tfa2 WH2 models were refined into the unsharpened OC3 map in Phenix 
and subjected to the phenix.model_idealization routine to optimize geome- 
try. Published NMR data of the Tfg2 WH domain®* (BMRB accession number 
17916) was used to calculate a low-energy model with the BMRB CS-ROSETTA 
server°’. The Tfg2 WH and an I-TASSER model of Tfal WH1 (residues 123-181) 
were placed into the OC3 density obtained from focused refinement (OC3- 
focused). The Tfg2 WH positioning is also consistent with NMR data for the 
Tfg2 WH-DNA interface®. A part of the Tfg1 arm domain (residues 327-349) 
was built into OC3 and OC4 densities and refined into the OC4 density. A pre- 
vious homology model of the TFIIF dimerization domain'® was used as a start- 
ing template for modelling into OC4 and OC4-focused density maps, and was 
refined into the OC4 density. The crystal structure of the Tfg1 N-terminal pep- 
tide (residues 21-35, see below) was fitted into an OC4 density (round 2 class 2, 
Extended Data Fig. 7d). The individually refined models showed good stereo- 
chemistry and were validated with Molprobity”’, and the FSC of map versus model 
(Extended Data Figs 8 and 9). 

To generate the CC model, the OC model of Pol II, TFIIA, TFIIB, TBP, TFIIE 
E-ribbon and TFIIF dimerization domain was rigid-body fitted into the CC density 
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using an automated global 6D correlation search in Situs’!. The remaining protein 
part was divided into two rigid bodies containing (i) Tfal and Tfa2 WH2, and 
(ii) Tfa2 WH1 and Tfg2 WH, which were independently fitted into the CC den- 
sity using Situs’. To model closed promoter DNA, upstream DNA was extended 
with canonical duplex B-form DNA in COOT® and rigid-body fitted in UCSF 
Chimera”” to reflect the density. 

All figures were generated using UCSF Chimera‘. 

Crystallographic analysis of Pol II-TFIIF complex. The structure of the Tfg1 
N-terminal region (residues 21-35) bound to Pol II was determined by X-ray crys- 
tallographic analysis. 12-subunit Pol II was prepared and crystallized>’, and TFIIF 
or selenomethionine (SeMet)-substituted Tfg1 peptide (residues 19-41, Peptide 
Speciality Laboratories GmBH, Heidelberg) was added to the cryo-protectant as 
described”’. Diffraction data were collected on a PILATUS 6M detector at the 
X06SA beamline (SLS, Villigen, Switzerland) at 100K. Data for the Pol II-Tfg1 
SeMet peptide crystal was collected at a wavelength of 0.97972 A and for the 
Pol II-TFIIF crystal at 0.91889 A. Data were processed with XDS and XSCALE”. 
The structure was phased with the crystal structure of the 12-subunit Pol II (PDB: 
3PO2) lacking nucleic acid and refined using BUSTER”. Model building of the 
TFIIF N-terminal residues in COOT® was guided by a selenium anomalous 
difference Fourier peak (M27). Subsequent refinement in BUSTER used secondary 
structure restraints for the Tfg1 peptide helix. The final structures had an Rfree 
factor of 18.0% and 19.4% for the Pol II-Tfg1 SeMet peptide crystal and the Pol 
II-TFIIF crystal, respectively (Extended Data Fig. 9c), and showed good stereo- 
chemistry’’. In the Pol II-Tfg1 SeMet peptide structure 90% of the residues fall 
in favoured regions of the Ramachandran plot and 3% in disallowed regions. 
For the Pol II-TFUF structure 89% of the residues fall in favoured regions of the 
Ramachandran plot and 3% in disallowed regions. 

Yeast and functional assays. TFIIE interfaces with Pol II and DNA were probed 
by in vivo mutagenesis. The yeast strain used for TFA1 genetic assays!° was pro- 
vided by S. Hahn (Fred Hutchinson Cancer Research Center). Mutations were 
introduced into a plasmid encoding TFA1 (pSH810, ARS CEN LEU2 3 x Flag), 
as indicated in Extended Data Fig. 3g. The tfal(AE-wing) mutant was gener- 
ated by deletion of TFA1 residues 71-84, and their replacement by the residues 
GSG. The TFA1 constructs were transformed into the shuffle strain, and were 
streaked twice on —Ura —Leu plates, and subsequently onto yeast extract- 
peptone-dextrose (YEPD) plates. Yeast were freshly grown in YEPD medium and 
resuspended in water to an OD of 1 at 600 nm, and tenfold dilutions were spotted 
on 5-fluoroorotic acid (5-FOA) and YEPD plates, and incubated at 30°C. 

To further characterize the interfaces between TFIIE and the initiation com- 
plex, recombinant TFIJE mutants were purified according to the same protocol as 
wild-type TFHE (Recombinant proteins, Extended Data Fig. 3g, h). The Tfal (poly- 
Ala E-wing) mutant was generated by replacement of TFA1 residues 71-84 with 
poly-Alanine. 

To assess the interaction of wild-type TFIIE and the TFIJE mutants with the 
CC by protein pulldown, 31g purified Pol II was first biotinylated on the Rpb3 
subunit as described!*. The CC, containing biotinylated Pol II, was subsequently 
prepared as above (‘preparation of initiation complexes’), but without TFIIE. This 
preparation was immobilized on 151L Dynabeads M280 streptavidin resin (Life 
Technologies), equilibrated in buffer Q. 10j1g TFIIE or TFIIE mutant (tenfold 
molar excess over Pol II) was incubated with the immobilized CC or control beads 
for 1h at 4°C. The beads were washed four times and bound proteins were analysed 
by SDS-PAGE (Extended Data Fig. 3h). 

Promoter-dependent in vitro transcription was used to determine the activity 
of TFIIE and the TFIIE mutants. The yeast strain used for TFA1 genetic assays'© 
was transformed with a plasmid containing 3 x Flag-tagged TFA1 (pSH810, ARS 
CEN LEU2 3 x Flag). Transformants were streaked once onto — Ura —Leu plates, 
once onto —Leu plates, twice onto 5-FOA plates, and subsequently onto YEPD 
plates. Nuclear extract from 31 yeast culture was prepared as described’’. The 
nuclear extract was immunodepleted of 3 x Flag—Tfal as described!” with the 
following modifications. Before nuclear extract was incubated with anti-Flag 
M2 agarose beads (Sigma), beads were incubated with 1 mg ml“ bovine serum 
albumin (BSA) protein (Sigma) for 1h at 4°C on a turning wheel followed by 
three wash steps. Immunodepleted nuclear extract was separated from beads by 
Micro Bio-Spin chromatography columns (Biorad). Specificity of the depletion 
was confirmed by western blot carried out as described previously’? for 3 x Flag- 
tagged Tfal (Macs Miltenyi Biotec, 130-101-572), Rpb3 (Neoclone, WP012), 
TFIIB (Abcam, sc-274) and Histone H3 (Abcam, ab21054) (Extended Data 
Fig. 3i). The secondary antibodies anti-rabbit IgG horse-radish peroxidase (HRP; 
GE Healthcare, NA934) and anti-mouse IgG HRP (Abcam, ab5870) were used 
(Extended Data Fig. 3i). Activator- and promoter-dependent in vitro transcription 
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and primer extension were carried out as described previously’. Recombinant 
TFUHE (5 pmol) and TFIIE mutants (5 pmol) were added to the depleted nuclear 
extract as indicated in Fig. 3c. Transcripts were visualized on a denaturing 8% 
polyacrylamide TBE gel with a Typhoon 9500 scanner (GE Healthcare) and 
quantified with ImageQuant (GE Healthcare). For quantification, the relative 
activity of each variant compared to TFIIE was calculated for each replicate. The 
mean intensity and standard deviation of three replicates was calculated from 
their relative activities. A second RNA product from the HIS4 promoter was 
observed, apparently resulting from an alternative upstream transcription start 
site. Although some differences in the relative use of the two TSSs are observed 
in the assay, we refrained from interpreting these. 
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Extended Data Figure 1 | See next page for caption. 
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Extended Data Figure 1 | Modelling of open complex cryo-EM 
densities. a, SDS-PAGE analysis of OC-cMed-Med1 complex after 
size-exclusion chromatography. Protein colours as in Fig. 1. Although core 
Mediator was required for stable association of TFIIE, it largely dissociated 
under cryo-EM conditions as observed previously'’. Some remaining core 
Mediator was flexible and located as described previously’’, but could not 
be included in further high-resolution analysis. b, Cryo-EM micrograph 
of the OC-cMed-—Med1 complex. Scale bar, 50 nm. c, Ten representative 
reference-free 2D class averages of OC-cMed-Med1 reveal flexibility 

of the upstream DNA assembly including TFIE (green arrow) and very 
weak density for core Mediator (orange arrow). Compare Extended Data 
Fig. 7a, c. d, Composite cryo-EM density of the OC shown in front and 
top views’. Colours indicate the cryo-EM densities used for modelling 

of the open complex (OCI, grey; OC2, green; OC2-focused, yellow; OC3, 
salmon; OC3-focused, blue; OC4 purple; OC4 round 2 class 2, light blue). 
Shown are the unsharpened cryo-EM densities. The percentage of particles 
from the full set of 257,259 that was used for the respective reconstruction 
is indicated. e, Composite cryo-EM density of the OC superimposed on 

a ribbon model of the OC, coloured as in Fig. 1. The composite cryo-EM 
density enabled modelling of the initiation factors and DNA. Our 
structure also enabled correction of the revised yeast initiation complex 
model obtained by Murakami et al. from cryo-EM at 6 A resolution”, and 
we note the following differences between the structures, superimposed 
on Rpbl: (1) The TFIIF Tfg2 WH domain is rotated by ~180°, which is 
further inconsistent with nuclear magnetic resonance (NMR) data on 

the TFg2 WH-DNA interface® and fits comparatively worse to protein-— 
protein crosslinking data between the Tfg2 WH and Tfa2 WH1 (ref. 17). 
(2) Domains of TFIIE, except Tfa2 WH1, were placed incorrectly: Tfal 
eWH (rotation and translation into the E-linker density; 17 A distance for 
helix «3 in our CC), Tfal E-ribbon (rotation and translation into E-linker 
density; 35 A distance between the Zn atoms), and Tfa2 WH2 (~180° 
rotation). Further, the Tfa2 E-tether region was incorrectly assigned to 
density belonging to the Tfal eWH. The Tfal E-linker was not modelled. 
(3) The TFIF Tfg1 arm was modelled into an empty space lacking density, 
and the Tfg1 helix «0 was absent. Our models of the TFIIF dimerization 
domain, Tfg2 linker, Tfg1 N terminus, and Tfg1 arm fit into densities from 
a recent study”’, indicating the electron microscopic reconstruction is 
correct, but that the modelling was premature at the available resolution. 

f, Ribbon model of the OC coloured according to how different parts of the 
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OC were modelled into the OC cryo-EM densities (see d). Regions with 
atomic (light blue) and backbone models (orange), and DNA (dark blue) 
are indicated. Views as in d. g, Representative regions of the sharpened 
cryo-EM densities OC1 (3.6 A), OC2 (4.0 A), and OC4 (3.9 A) are shown 
with the underlying refined coordinate model. The OC1 density shows 
clear side-chain features for Rpb1 clamp helices a8 and «9 and Rpb2 833, 
the OC4 density for Tfg1 82 that is part of the dimerization domain, and 
the OC2 density for part of the Tfg2 linker. For OC nomenclature, see 
Extended Data Fig. 7. h, Fit of the TBP crystal structure (PDB: 1YTB)® 

to the OC2 cryo-EM density, shown in a Pol II side view”’. i, Fit of TFIIB 
N- and C-terminal cyclin domains, B-linker and B-reader, and B-ribbon 
elements to OC1 and OC2 cryo-EM densities. The B-linker element 
displays weak density, and the B-reader is not observed. j, Fit of the TFIIA 
crystal structure (PDB: 1YTF)*? to OC2-focused cryo-EM density in a 

Pol II top view” (left). The four-helix bundle undergoes a minor rotation 
towards the 3-barrel, and is apparently flexible (compare Extended Data 
Fig. 5e). Toal (middle) and Toa2 (right) subunit structures are shown. A 
large non-conserved insertion in Toal (A95-209), lacking in recombinant 
TFIIA (Methods), may affect the relative positioning of the four-helix 
bundle to the B-barrel. k, Fit of the TFIIF model to OC cryo-EM densities 
viewed from the top>’. TFIIF dimerization domain and Tfg1 N-terminal 
region, arm, and charged helix elements are superimposed on the OC4 
cryo-EM density. Tfg2 linker and WH domains are superimposed on OC2 
and OC3-focused cryo-EM densities, respectively. Subunit architectures 
for Tfg1 (middle) and Tfg2 (right) subunits are shown, indicating 
disordered regions. Secondary structure elements were labelled according 
to the crystallographic model of the human RAP30-RAP74 heterodimer’”?. 
1, Fit of the TFIIE model to OC cryo-EM densities shown from the front?” 
(left). Models for Tfal eWH, E-linker and E-ribbon are superimposed onto 
OC1 and OC3 densities. Models for Tfa2 WH1 domain, Tfa2 WH2 and 
E-tether were fitted into OC3-focused and OC3 densities. Tfal (middle) 
and Tfa2 (right) subunits are shown, indicating disordered regions. The 
connectivity of the E-tether helices remains uncertain. m, Fit of promoter 
DNA to OC cryo-EM densities is shown ina side view’. A weak density 
for single-stranded template DNA contacts the Pol II fork loop 1, and is 
indicated by a blue arrow. Upstream and downstream DNA models are 
superimposed with OC3 and OC1 densities, respectively. The location of 
the Pol II active site magnesium ion is indicated. 
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Extended Data Figure 2 | Details of TFIIF and the upstream DNA 
assembly. a, View of the open complex from the side°’. Pol II elements 
external 2 (dark green), lobe (yellow), protrusion (orange), Pol II subunit 
Rpb12 (dark blue) and basal factors TBP, TFIIB, and TFIIF are coloured 
as in Fig. 1. The remainder of the open complex is transparent. Green and 
purple boxes indicate the locations of TFIIB C-terminal cyclin and TFIIF 
dimerization domains, respectively. b, Interactions of TFIIB C-terminal 
cyclin domain with Pol II protrusion, Rpb12, Tfg2 linker and DNA. 
Colours as in a. c, Details of TFHF dimerization domain interactions 
with Pol II external 2 and lobe°’. d, Crystallographic analysis of the yeast- 
specific Tfg1 N-terminal region. Weak density for the Tfg1 N-terminal 
region was observed by cryo-EM (OC4 round 2 class 2) at low contour 
level (0.0155) close to Pol II elements external 1 and the hybrid binding 
region” (left). X-ray analysis (right) of the corresponding peptide (Tfg1 
F21-R35) enabled modelling and assignment of residue M27 (indicated 
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with asterisk) owing to the anomalous signal. The F, — F, electron density 
map (grey, contour level 2.50), seleno-methionine anomalous difference 
Fourier (yellow, contour level 5c), and final model in ribbon presentation 
(purple) are shown. The sequence of the synthetic peptide used for 
soaking into Pol II crystals is shown below. The modified methionine 
residue and predicted secondary structure are indicated. e, The Fy — F, 
electron density maps obtained from soaking Pol II crystals with TFIIF 
(purple) and seleno-methionine labelled peptide (grey), respectively, 
show similar density in the same location on Pol II. f, The putative Tfg2 

C terminus contacts TBP. Viewed from the side*’. A tubular cryo-EM 
density from the OC3 map, low-pass filtered to 8 A, emanates from the 
TFIF Tfg2 WH-TFUE Tfa2 WH1 density, and was tentatively assigned 
to the Tfg2 C-terminal region. The putative Tfg2 density reaches the TBP 
subunit, consistent with their suggested interaction’””®, 
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Extended Data Figure 3 | Structure-function analysis of TFIIE and its 
interactions in the open complex. a, The architectural model of TFIE 
contains all regions required for viability in yeast'®. A domain schematic 
(top) indicates the good overlap between modelled (dashed line) and 
functionally essential regions. Essential (grey), partially redundant Tfa2 
WH1 and WH2 domains (blue), and non-essential elements (cyan) are 
indicated on the TFIIE model, shown in previously defined front and 

top views”” of Pol II. b, TFIIE sequence conservation. The sequence 
conservation among Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Drosophila melanogaster, Gallus gallus, and Homo sapiens was mapped 
onto a ribbon representation of the TFIIE model. Highly, strongly, weakly 
and non-conserved residues are coloured in green, yellow, white, and 
grey, respectively. The location of a non-modelled helical density in the 
OC3 cryo-EM map, which may correspond to Tfal helix «7, is indicated. 
Views as in a. c, An additional density (green) in the OC3 cryo-EM map 
on top of the Tfal E-wing was tentatively assigned to Tfal helix «7 and this 
may stabilize the long 8-hairpin. A front view is shown”. d, Tfal-FeBABE 
cleavage sites in TFIIE”® are consistent with the TFIIE architecture. 

e, Tfal- and Tfa2—FeBABE cleavage sites in the Pol II clamp!° and a 
protein-protein crosslink between Rpb1 K212 (Pol II clamp)-Tfa2 K277 
(TFIIE E-tether)!” are consistent with the location of eWH and E-tether. 

f, Tfa2-Tfg2 protein-protein crosslinks’” are consistent with the Tfg2 
WH-Tfa2 WH1 architecture. g, The TFIIE mutations used for functional 
characterization were mapped onto a domain schematic and the model 

of TFIIE, shown in a front view’. h, Pulldown assays with recombinant 
TFIIE variants carrying mutations at the TFI[E—-CC interface revealed 
that the E-ribbon is essential for TFIIE recruitment. For details of the 


TFIE mutants, see g. Pulldowns were analysed by SDS-PAGE (Coomassie 
staining). To confirm the integrity of the purified TFIIE variants, 2 jug were 
analysed (left). Some minor contaminant and degradation bands of TFIHE 
are indicated by an asterisk. The bead elution from the pulldown assay 

is shown (middle), providing negative (no TFIIE) and positive (TFIIE) 
controls in the two leftmost lanes. The binding of all TFIIE variants to the 
CC was impaired compared to the wild-type protein, with the exception 
of the Tfal(AE-wing) mutant, suggesting that all other interfaces 
contribute to TFIIE binding affinity. The most severe binding defect was 
observed upon mutation of three residues in the E-ribbon (Tfa1(L134/ 
V137/L140)) to glutamate or alanine. This suggests that the E-ribbon is 
largely responsible for recruitment of TFIIE to the CC. The bead-only 
control (right) indicated that TFIIE and TFIHE variants did not show 
unspecific binding to the beads. i, Western blot analysis of the 3 x Flag- 
tagged Tfal confirms specific immune-depletion of Tfal in the nuclear 
extract (NE), whereas levels of Pol II (Rpb3), TFIIB, and Histone H3 were 
unaffected. j, Yeast complementation assays were performed in triplicate 
experiments with wild-type TFA1, an empty vector, and TFA1 variants 
with mutations in the TFITE eWH domain (N50E/K51E/T52E, N50A/ 
K51A/T52A, and P56A/A59E/R62E in eWH helix a3, and AE-wing), or 
the E-ribbon(L134E/V137E/L140E) (see Methods). k, The long E-wing in 
the TFIIE subunit Tfal eWH is characteristic of WH domains involved in 
DNA strand separation’’. The upstream edge of the transcription bubble 
and eWH domain are shown in a front view” rotated by ~20° in the 
horizontal axis. Corresponding regions of human (Hs) Werner syndrome 
ATP-dependent helicase (WRN) WH (PDB: 2WWY) and RecQ1 WH 
(PDB: 3AAF) domains are shown. 
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Extended Data Figure 4 | Closed complex and spontaneously formed 
open complex. a, SDS-PAGE analysis of CC-cMed-Med1 complex after 
size-exclusion chromatography. Protein colours as in Fig. 1. b, Cryo-EM 
micrograph of the CC-cMed-Med1 complex. Scale bar, 50 nm. c, Ten 
representative reference-free 2D class averages of CC-cMed-Med 1 reveal 
flexibility for the upstream complex. Core Mediator was not retained 
during cryo-EM analysis. d. Detailed view of the Pol II funnel helices in 
the CC (top) and OC5 (bottom) densities. e, Promoter sequences and 
differences in protein-DNA interactions are shown for the two distinct 
nucleic acid scaffolds used for preparation of closed and open complexes 
(compare Fig. 1d). Coloured bars indicate DNA-protein interaction. Solid, 
shaded, and empty circles respectively represent nucleotides included 

in the structure, excluded owing to weak cryo-EM density, or excluded 
owing to absence of cryo-EM density. Analogous yeast (black) and human 
(grey) numbering of promoter DNA is shown. The TATA-box sequence 
(red box) and HIS4-promoter sequence absent in the modified OC nucleic 
acid scaffold’? (grey box) are indicated. Protein-DNA interactions in the 
region covered by the light grey box are unchanged between CC and OC, 


and shown only for the OC for clarity. Unique and altered interactions 

are shown for each complex. DNA-TFHEa photo-crosslinks, indicated 

by black asterisks, were observed in a closed but not open promoter 

state*? and are consistent with the CC model. f, Fit of TFIIE, Tfg2 WH 

and downstream DNA into CC density. Two rigid bodies were used for 
fitting: (i) Tfg2 WH and Tfa2 WH1 and (ii) Tfa2 WH2, eWH, E-linker 

and E-tether helices. Although the overall fit reflects density well, the 
eWH domain and its E-wing may be rotated further away from promoter 
DNA. g, Details on the location of downstream DNA (template, blue; non- 
template, light blue), Tfg2 WH, and Tfa2 WH1 and WH2 in the closed 
(dark colours) and open (light colours) complexes in the same view as in f. 
h, Cryo-EM density of OC5 and the OC ribbon model are shown in a front 
view’’. The OC5 map shows weak density in regions of upstream assembly, 
TFIE, and DNA that may be caused by increased flexibility owing to the 
heterogeneous population of spontaneously opened DNA. Colours as in 
Fig. 1. i, Fit of promoter DNA to cryo-EM densities of CC and OC5, shown 
in a side view”. 
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Extended Data Figure 5 | Pol II cleft clearance, structural flexibility 
and rearrangements in the OC. a, Pol II lid and fork loop 1 assume new 
conformations in the OC, clearing the Pol II cleft for loading of single- 
stranded template DNA. Arrows indicate the direction of movement 

of the two Pol II elements, and the template DNA loading path. The lid 
(dark red) in the open complex is moved in comparison to the lid of a Pol 
II-TFIIB ITC crystallographic study (PDB: 4BBS). Yellow and red boxes 
indicate zoomed-in regions of b and c, respectively. b, The movement in 
the Pol II lid leads to a steric clash with the TFIIB B-reader, observed in 
a Pol II-TFIIB ITC crystal (PDB: 4BBS), and facilitates its withdrawal in 
the open complex. In particular the lid residue F252 clashes with W63 
and S67 of the B-reader. The OC1 cryo-EM density is shown for both 
lid and B-reader elements. c, The cryo-EM density of the OC1 reveals 
an ‘open Pol II fork loop 1 and a stably associated fragment of putative 
template DNA. The ‘open’ state of fork loop 1 provides additional space 
for loading of single-stranded template DNA past the Pol II rudder, 
towards the active site cleft. d, The position of the TFHB N-terminal 
cyclin domain (light green) is altered in comparison to a Pol W-TFIB 
ITC crystal structure’? (dark grey), but similar to its location in a cITC” 
(light grey), probably owing to the presence of DNA. e, Flexibility of the 
upstream DNA assembly. The cryo-EM data of the OC was sorted on 
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the basis of structural differences using an upstream assembly mask that 
included upstream DNA, TFIIA, TBP, and TFIIB cyclin domains (OC2 
round 1, compare Extended Data Fig. 7c). Four of five resultant classes 
revealed different positions of the upstream complex, indicated here by 
fitted ribbon models of the OC. Previously defined front and side views” 
are shown. Class 2 (middle) revealed the TFIJA four-helix bundle rotated 
by 85°, consistent with a high degree of flexibility. Class 4 represents the 
largest fraction of the data (31%), and gave a more defined density for 

the upstream complex, which was improved by further classification 
(Extended Data Fig. 7c). Class 5 presented with no density for the 
upstream complex or the Tfg2 linker, but did show density for the TFIIB 
B-ribbon and the TFIIF dimerization domain, suggesting that TFIIB and 
TFIIF remained bound to the complex. This is consistent with TFIIF- 
dependent association of the TFIIB-core domain with the Pol II wall?’, and 
this apparently requires an ordered Tfg2 linker. f, The Rpb4—Rpb7 stalk 
adopts different positions in cITC, cITC-cMed, and OC. This suggests 

that Mediator and TFIE may bind co-operatively. This is consistent with 
previous findings’* and with pulldowns (Extended Data Fig. 3h), which 
suggest that the TFIIE E-ribbon-stalk interface, which is important for 
TFUE recruitment, is stabilized in the presence of Mediator. 


0 
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Extended Data Figure 6 | Pol II clamp positions and TFIIB B-reader 
mobility during DNA opening. a, The yeast CC is shown from a side 
view’, indicating the path of DNA and location of TFIIE. The eye symbol 
(grey) indicates the point of view in b. b, The Pol II clamp may undergo 
transitions during DNA opening as indicated. The OC model of the Pol II 
clamp is shown superimposed on yeast CC (this study), and yeast OC 
(this study). The OC model Pol II clamp was rigid-body fitted to the 
human CC cryo-EM density”? (EMD-2306) and is superimposed. The 
view is from the front’. c, The TFIIB B-reader element shows strong 
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density only in the ITC state, suggesting that its mobility in earlier states 
may be important for maintaining a cleared path for template DNA 
loading into the Pol II cleft. Ordering of the B-reader may further lead 

to stabilization of the upstream promoter assembly that is flexible in the 
OC (Extended Data Figs 5e, 7c). Cryo-EM densities for yeast CC (this 
work), OC5 (this work), OC (this work), and ITC (EMD-2785) complexes 
are superimposed on the TFIIB model (PDB: 4BBS for the B-linker and 
B-reader). As secondary structure elements could not be resolved in the 
human CC”, we excluded this cryo-EM density from comparison. 
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Extended Data Figure 7 | See next page for caption. 
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Extended Data Figure 7 | Three-dimensional classification of cryo-EM 
data. a, Three-dimensional image classification of the cryo-EM data 

set into eight classes using an initial OC reconstruction as the reference 
model, revealed heterogeneity. The percentage of single particles 
contributing to each class is provided. To help visualize structural 
differences, 3D reconstructions of the OC are coloured according to 
mobile regions: Pol II core, TFIIB B-ribbon (grey); upstream DNA, TFIIA, 
TBP, TFIIB cyclin domains, Tfg2 linker (green); TFIIF dimerization 
domain (purple); TFIIE except E-ribbon, Tfg2 WH (magenta); Pol II 
Rpb4-Rpb7 stalk and E-ribbon (blue); cMed-Med1 (yellow). b, Focused 
classification into five classes using a mask covering the Pol IT stalk and 
E-ribbon. The resultant class 1 (OC1) was subsequently refined to 3.58 A 
resolution (grey box) and revealed the location of the TFIIE E-ribbon. 
Colours as in a. c, Improvement of densities for Tfg2 linker, TFIIB, and 
TFIIE, through rounds of focused 3D classification using various masks. 
First, heterogeneity due to flexibility of upstream DNA and associated 
factors was overcome by applying a mask around this region (round 1). 
Focused refinement of the upstream DNA assembly of the resultant class 
4 of round 1 (OC2-focused), improved the density quality for TFIIA 
(Extended Data Fig. 1j). Classification of the OC2-focused density 
revealed the upstream DNA complex (OC2) at 3.97 A resolution (green 
box). Separate classification of class 4 of round 1 using OC, Pol II stalk 
and TFIIE E-ribbon, and TFIIE masks yielded class 1 of round 4 (OC3, 
magenta box) that contained a complete TFIIE density at a nominal 
resolution of 4.35 A after 3D refinement (see Extended Data Fig. 8c). The 
small fraction of stably bound TFHE is consistent with its reduced affinity 


to the pre-initiation complex”’. Focused refinement of OC3 with a TFIIE- 
stalk mask (OC3-focused) improved density for Tfg2 WH and Tfa2 WH1 
domains. Colours as in a. d, To improve the density of TFIIF dimerization 
domain and the Tfg1 arm, three rounds of classification using a TFIIF, 
TFIIF dimerization domain, and OC mask were employed. Class 2 of 
round 2 (cyan box) enabled fitting of the Tfg1 N-terminal peptide, which 
was resolved by X-ray analysis (Extended Data Fig. 2d, e). This class was 
further refined locally using a mask covering the TFHF dimerization 
domain, and then classified with an OC mask, revealing class 6 of round 3 
at 3.89 A resolution after 3D refinement (purple box). Colours as in a. 

e, 3D classification of the CC cryo-EM data set into four classes, using an 
initial CC reconstruction as the reference model, revealed heterogeneity. 
Mobile regions in the reconstructions are highlighted: promoter DNA 
(blue), TFIIE (except E-ribbon), and Tfg2 WH (orange). Classifying the 
most populated classes from round 1 into three classes unexpectedly 
revealed open and closed promoter DNA states in the data set: CC (round 2, 
class 1) and OCS (round 2, class 3). Class 3 of round 2 (OCS) was refined 
to 6.1 A resolution (blue box). Class 1 from round 2 was further classified 
into three classes. The resultant class 3 of round 3 revealed density for 
closed downstream promoter DNA above the Pol II cleft, and TFIIE. 

The cryo-EM density for downstream DNA and TFIIE was improved by 
focused classification using two soft-edged masks. A mask covering the 
Pol II Rpb4-Rpb7 stalk yielded a class with better occupancy for the stalk 
(round 4, class 3), which was further sorted using a mask covering TFHE 
and Tfg2 WH to improve their densities. Class 1 of round 5 was refined to 
8.8 A resolution (CC, orange box). 
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Extended Data Figure 8 | Resolution of cryo-EM reconstructions. density obtained from focused refinement (OC2-focused) with a soft mask 
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reconstruction (FSC = 0.143). Orientation distribution plot of all particles | The region masked for focused refinement is indicated with a grey outline 
that contribute to the OC1 reconstruction (middle). The OC1 cryo-EM on the cyro-EM map coloured by local resolution (right). c, As in b, but 
map is shown (right) from a front view and a central slice through the for the OC3 and OC3 focus-refined reconstructions. d, As in a, but for the 
reconstruction, which are coloured by local resolution as described’. OC4 and OC4 focus-refined reconstructions. e, As in a, but for the CC 
b, As in a but for the OC2 reconstruction. The gold-standard FSC for the reconstruction. f, As in a, but for the OC5 reconstruction. 
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a = Cryo-EM data collection and refinement statistics of the OC structure 
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C X-ray crystallographic data collection and refinement statistics 
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Model composition Reon Reece (%) 15.4/18.0 16.2/19.4 
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Ramachandran plot 
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Allowed 5.4% 6.8% 8.4% 
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*A model of Upstream assembly components was refined into the OC2 cryo-EM density: Pol II protrusion (residues 
44-218 and 405-465), Tfg2 Linker (residues 249-280), TFIIB N-and C-terminal cyclin domains (residues 124-326), 
and the Tfg1 arm (residues 327-335). 
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OC2, OC4) and overall model and cryo-EM map (OC3, CC) 
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Extended Data Figure 9 | Data collection, refinement statistics, and 


structure validation. a, Cryo-EM data collection and refinement statistics 
of the OC structure. Different regions of the composite OC structure were 


refined into OC1, OC2, and OC4 maps as described (see Methods) to 


obtain an atomic model for 90% of the structure. b, Gold-standard FSC 


0.30 0.35 0.37 


between the respective coordinate models and local regions of the OC1, 


OC2, and OC4 cryo-EM maps used for model refinement and between 
overall OC and CC models compared to OC3 (best TFHE density) and CC 


cryo-EM maps. c, X-ray crystallographic data collection and refinement 


statistics. 
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Near-atomic resolution visualization of 
human transcription promoter opening 


Yuan He!?, Chunli Yan°, Jie Fang*, Carla Inouye”, Robert Tjian**°, Ivaylo Ivanov? & Eva Nogales!*°® 


In eukaryotic transcription initiation, a large multi-subunit pre-initiation complex (PIC) that assembles at the core 
promoter is required for the opening of the duplex DNA and identification of the start site for transcription by RNA 
polymerase II. Here we use cryo-electron microscropy (cryo-EM) to determine near-atomic resolution structures of 
the human PIC in a closed state (engaged with duplex DNA), an open state (engaged with a transcription bubble), and 
an initially transcribing complex (containing six base pairs of DNA-RNA hybrid). Our studies provide structures for 
previously uncharacterized components of the PIC, such as TFIIE and TFIIH, and segments of TFIIA, TFIIB and TFIIF. 
Comparison of the different structures reveals the sequential conformational changes that accompany the transition 
from each state to the next throughout the transcription initiation process. This analysis illustrates the key role of TFIIB 
in transcription bubble stabilization and provides strong structural support for a translocase activity of XPB. 


Eukaryotic gene transcription is tightly controlled during the initia- 
tion stage, when RNA polymerase II (Pol IT) and the general transcrip- 
tion factors (GTFs) (TFUD, TFIIA, TFIIB, TFITE, TFIIF and TFIIH) 
assemble at the promoter into a PIC! °. Recruitment of TFIID, TFIIA 
and TFIIB to the DNA facilitates correct loading of the Pol II-TFIIF 
complex, followed by engagement of TFIIE and TFIIH*. 

The initial closed promoter complex (CC) must transition into an 
open complex (OC), where the melted single-stranded template DNA is 
inserted into the active site and Pol II locates the transcription start site 
(TSS). This transient OC is then converted into an initial transcribing 
complex (ITC), where the first phosphodiester bond forms and mes- 
senger RNA starts to be synthesized®®. Following abortive cycles of 
synthesis of short RNA products’, Pol II eventually clears the promoter 
and a stable elongation complex (EC) forms. 

Crystallographic structures and crosslinking data have led to models 
of partial PIC assemblies for Saccharomyces cerevisiae*'°. Cryo-EM 
studies of human’! and yeast'*’ PIC complexes have described larger 
assemblies. Recently, we combined the cryo-EM structures of human 
TFIID-TFIIA-DNA and a TBP-based PIC to generate a model of 
the full TFIID-based structure that describes how TFHD recognizes 
the core promoter and contributes to PIC recruitment'*. However, 
the atomic resolution description of this complex machinery and 
its transitions between the CC, OC, ITC and EC states are yet to be 
determined. 

Here we describe cryo-EM structures at 3.9-8.6 A resolution lead- 
ing to near-atomic models of the human TBP-based PIC throughout 
transcription initiation. Comparisons between states provide new 
mechanistic insights into the processes of DNA engagement, promoter 
melting and transcription bubble stabilization. 


Visualization of human initiation complexes 

We used our previously developed system for assembly and purifica- 
tion of DNA-bound human pre-initiation complexes, in which we used 
TBP in place of TFIID". We included TFIIS because of its potential 
role in PIC formation, in addition to its role in elongation’. Given 
that TFIIS can stimulate the weak RNA nuclease activity of Pol II'®'”, 


we also analysed the ITC structure without TFHS (denoted hereafter 
as ITC(—IIS)). In order to mimic the CC, OC and ITC states, we used 
three different nucleic acid scaffolds (Fig. 1a). This strategy allowed us 
to visualize the changes in the PIC, from its assembly on duplex DNA, 
to one preceding clearance of the promoter. Our purified PICs were 
functionally active (Extended Data Fig. 1). The overall architecture of 
the four states analysed (Fig. 1b) resembles our reconstructions of CC 
published previously'?". 

Local resolution estimation and sorting procedures (Extended data 
Figs 2, 3) showed that the TFIIH attachment to the rest of the PIC 
has some flexibility, thus limiting the resolution of the full PIC owing 
to slight misalignment. To overcome this limitation we also obtained 
reconstructions of the ‘PIC core; in the absence of TFIIH (Fig. 1c). 
The density maps for the full PIC and the core PIC were reconstructed 
using the same data set for each state, generated by automatically pick- 
ing particles from images of samples that included all GTFs, but for 
which some of the complexes were incomplete. A different strategy 
was used for the core CC (see Extended Data Fig. 2 versus Extended 
Data Fig. 4), because for the CC sample, TFIIE was often not present 
in complexes lacking TFIIH (in agreement with a cooperative mode 
of incorporation for TFIIE and TFIIH into the PIC"’). Instead, a local 
focused refinement of the CC core was conducted for the particles used 
to reconstruct the full CC (Extended data Fig. 2). 

The global resolution for the core OC, ITC and ITC(—IIS) states is 
~3.9A, and for the CC core is 5.4. A. Peripheral GTFs are more flexible 
and ata slightly lower resolution, ranging from 4.5 A to 7.5 A. However, 
Pol II and most of TFIIB were determined beyond 4.5 A resolution, 
approaching 3 A in certain regions (Fig. 2, Extended Data Fig. 4 and 
Supplementary Videos 1, 2). For the rest of the complex, secondary 
structure was clearly visible throughout, allowing homology modelling 
and flexible fitting to eventually produce near-atomic models for the 
PIC at each step (Fig. 1d and Extended Data Fig. 5). The nucleic acids 
are clearly visualized, and separation between bases is apparent close 
to the active site (Fig. 2b and Supplementary Video 2). Interestingly, 
there is no density for RNA in the presence of TFIIS (ITC), whereas 
the RNA is clearly visible in the absence of TFHS (ITC(—HS)) 
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Figure 1 | Cryo-EM reconstructions of human PIC in different states 
during the initiation process. a, Nucleic acid scaffolds used. Filled and 
open circles correspond, respectively, to the core promoter and poly-T 
mismatch sequences. For the OC, two additional bases upstream are 


(Fig. 2b). This result indicates that TFUS may have changed RNA affin- 
ity for the template DNA, caused backtracking of Pol II, or activated 
Pol II nuclease activity. 
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Figure 2 | Examples of near-atomic resolution regions. a, EM densities 
and corresponding atomic models for the double-1 8-barrel domain 
composing the conserved core of Pol II within RPB1 and RPB2 (ref. 49) 
(see also Supplementary Video 1). b, Structural details around the Pol 

II active site. Densities are shown at two different thresholds. The lower 
threshold (mesh) allows visualization of the more flexible elements. 
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opened in the cryo-EM structure. Black circles correspond to RNA. 
b-d, Cryo-EM reconstructions of holo-PICs (b), core PICs (c), and MDFF 
models (d), for the CC, OC, ITC and ITC(—IIS) states. 


Revised structural models for core PIC components 

We traced the entire length of TFIIB in the OC, ITC and ITC(—IIS) 
reconstructions, and most of it in the CC structure (Fig. 3a and 
Extended Data Fig. 5a). Instead of the TFIIB linker helix described in 
crystal structures of yeast Pol II-TFIIB®'’, we observed an extended 
loop directly contacting the single-stranded non-template DNA. This 
segment is the least conserved region in TFIIB and may represent a 
functional departure between yeast and humans. Alternatively, the 
structure of this region of TFIIB within our PIC complexes could be 
a result of the interaction with its natural ligand, an initiation DNA 
bubble, which was absent from the crystallographic studies. In fact, 
the entire path of the B-linker is slightly different between the human 
and yeast structures (Fig. 3a). 

Our structures revealed an extended interface between the 
N-terminal lobe of TBP and residues 307-332 in TFIIAB (Fig. 3b, 
Extended Data Fig. 5b). This interaction is in agreement with mutagen- 
esis studies!®°, and was partially captured in a structure of yeast 
TBP-TFIIA-TATA-box”!. This conserved surface on TBP is a site for 
association with transcription activators or repressors”””’, and has been 
shown to be critical for releasing the inhibitory effect of TAF1 within 
TFIID, possibly through binding to TFIIA™. 

The RPB1 trigger loop has been observed in crystallographic struc- 
tures of the yeast Pol II in either a closed ‘on’ state, engaged with the 
last-added nucleotide”, or an open ‘off’ state”®. In all of the initiation 
states described here, the trigger loop adopts an ‘off’ state (Fig. 3c, 
Extended Data Fig. 5c). 

Consistent with previous models of TFIIF within the PIC!}?””8, 
the dimerization domain of its RAP30 and RAP74 subunits is located 
near the RPB2 lobe and RPB9 jaw of Pol II, whereas the C-terminal 
winged-helix (WH) domain in RAP30 directly contacts the BREd 
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Figure 3 | Newly visualized structural elements of the human PIC. 
Segmented cryo-EM densities from the ITC(—HsS) reconstruction 

shown at two different thresholds (the lower threshold (mesh) facilitates 
visualization of the more flexible elements). a, Differences in the path of 
the B-linker of TFIIB between the yeast crystallographic model!° (green) 
and the human ITC cryo-EM structure (coloured as in the rest of the 
figures). Superimposition was performed by aligning the rigid part of Pol 
II (that is, excluding the clamp and the stalk), here and for all other 
figures. b, Residues 307-332 of TFIIAG interact with the N-terminal lobe 
of TBP. c, Trigger loop in RPB1 in an open off’ state. The trigger loops in 
crystal structures of yeast EC in its closed ‘on’ state (2NVZ”», light blue), 
and in its open ‘off’ state (1Y1V°, green) are shown for comparison. 

d, RAP30 WH domain and linker within TFIIF. Elements of the PIC 

in close proximity are indicated. e, Key domains of TFIIE and their 
interaction partners within the PIC. f, Cryo-EM structure of TFHH (the 
flexible CAK subcomplex is not visible after averaging) obtained from a 
combined data set of open promoter states. Segmentation based on fitting 
of available structures and homology models. 


(downstream TFIIB recognition element) DNA, directly above the 
RPB2 protrusion (Fig. 3d, and Extended Data Fig. 5e). We modelled 
the RAP74 al helix, an additional «2 helix, and the linker between al 
and the dimerization domain, following a continuous region of density 
in our maps. Residues 119-175 in RAP30, which constitute the linker 
between the dimerization and WH domains, were modelled de novo 
for the four states. This linker makes extensive contacts with the RPB2 
external 2 and protrusion of Pol II, as well as with TFIIB, TBP, and the 
BREd, implying that it plays a critical role in the correct positioning of 
the RAP30 WH domain for stabilization of the promoter DNA. This 
proposal is consistent with studies in yeast showing that any truncation 
from the WH end of the linker is lethal for growth?*. We could not 
localize the RAP74 arm domain with certainty in any of the states, 
indicating that this part of TFIIF is probably flexible. 

Our density maps allowed us to model the major domains in TFIE: 
the WH, zinc ribbon (ZR) and helix-turn-helix (HTH) domains of 
TFIIEa, and the WH1 and WH2 domains of TFIIES (Fig. 3e and 
Extended Data Fig. 5d). The relative positioning of the three WH 
domains in TFIE follows the previous beads-on-a-string model based 
on XL-MS (protein crosslinking coupled with mass spectroscopy) 
analysis’, but the orientation of the domains had to be revised. The first 
three helices in the WH domain of TFIIEq are involved in dimeriza- 
tion with the WH2 domain of TFIIE8, whereas the 4th helix directly 
interacts with the loop at the tip of the RPB1 clamp coiled-coil in the 
three open promoter states. Further down, the first helix in the WH1 
domain of TFIIE@ is involved in the interaction with the WH domain 
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of RAP30. The ZR domain of TFIIEqa associates with the deep groove 
in the RPB7 oligonucleotide/oligosaccharide-binding fold (OB fold) in 
the stalk. Two helices extending from both sides of the ZR were mod- 
elled following helix-like densities directly associated with the WH in 
TFIIEa. Additional densities near the HTH domain of TFIIEa might 
correspond to the C-terminal extension of TFIIEG following the sec- 
ond WH, contributing to an extended dimerization interface between 
the two TFIE subunits. This proposal agrees with crosslinking studies 
indicating that the C terminus of TFIIE( is close to the region flank- 
ing the ZR within TFIIEa, in addition to the WH®”’. This C-terminal 
extension has been shown to be particularly important for viability’, 
in agreement with a critical contribution to the dimerization between 
the two subunits of TFIIE. 

Despite previous efforts””~~, we still lack a detailed subunit archi- 
tecture for TFIIH. By combining the three open promoter data 
sets (OC, ITC and ITC(—IIS)) and following a focused refinement 
procedure (Extended Data Fig. 6), we produced an improved cryo-EM 
reconstruction of DNA-bound, ‘core’ TFITH (not including the mobile 
kinase module") that allowed the clear visualization of the major and 
minor grooves of the TFIH-engaged DNA, and the modelling of six 
out of the seven core subunits (Fig. 3f). Consistent with our previous 
model'!, XPD is near the Pol II stalk and TFIIE, whereas XPB is located 
at the other end of the horseshoe-shaped TFIIH density, engaging the 
downstream DNA. When comparing our XPB model and the structure 
of the Sulfolobus solfataricus SW12/SNF2 ATPase core bound to DNA*’, 
the N-terminal domain of both enzymes interacts with the duplex DNA 
in a very similar manner (Extended Data Fig. 6b, c). However, the 
C-terminal domains are rotated with respect to each other (Extended 
Data Fig. 6d). The rotation resembles the conformational change pro- 
posed for the SsoRad54 ATPase* on the basis of a FRET (fluorescence 
resonance energy transfer) study and the crystallographic structure of 
the protein in the absence of DNA”. In our model, the DNA-bound 
XPB is in a completely different conformation from that captured for 
SsoRad54, emphasizing the extensive conformational plasticity of these 
enzymes, probably related to the ATP-driven changes that result in 
DNA translocation. 

The crystal structure of P8 and the C-terminal dimerization domain 
of P52 fits well into our reconstruction, forming a heterotrimer with 
XPB on one side of the complex (Fig. 3f). We propose that the P52 
Hub-A domain (hA-D)°? interacts with the N-terminal, TFB2C-like 
domain of XPB (not built into the density), whereas P8 interacts with 
the XPB C-terminal domain. This model is consistent with the stim- 
ulation of XPB ATPase activity by both P52 and P8 (refs 36, 37). The 
N-terminal, HEAT repeat domain in P52 serves as a bridge between 
the two ends of TFIIH. It directly interacts with P34, which in turn 
contacts the P44—XPD subcomplex on the other side of TFIIH. Because 
of the lower resolution for this region, possibly owing to flexibility, the 
orientations of the P34 von Willebrand A (VWA) domain and the P52 
HEAT repeat are tentative. There is still unexplained density that proba- 
bly corresponds to P62 and the C-terminal ring finger domains of both 
P34 and P44. We propose that the P62 PH domain may correspond to 
a flexible density observed near TFIIE for our four reconstructions of 
the full PIC (Extended Data Fig. 6e). 


30-32 


Comparison of CC structure with previous yeast models 

The crystallographic structures of yeast Pol II-TFIIB®’*, TFIIB-TBP- 
TATA DNA®*’, and TFIIA-TBP-TATA DNA”! can be combined to 
generate a model of the Pol II-TFIIB-TFIIA-TBP and DNA complex. 
In this ‘synthetic’ structure, the TBP-TFHA-TFIIB subcomplex and 
Pol II clamp and stalk regions need to be rotated to fit the human CC 
cryo-EM map (Extended Data Fig. 7a—c). These three modules appear 
inter-connected: movement of TBP-TFIIA-TFIIB-DNA closer to Pol 
II would cause a clash of the DNA with the clamp, thereby pushing 
the clamp open; opening of the clamp results in tilting of the stalk 
and engagement of the DNA duplex. As the axis for the TBP-TFIIA- 
TFIB-DNA module rotation is by the N-terminal cyclin fold of TFIIB 
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(Extended Data Fig. 7a), we speculate that TFIIB in the crystal structure 
with Pol II may be in a DNA pre-engagement state. We also observe 
differences between our structure and a previously published yeast 
cryo-EM CC model’? (Extended Data Fig. 7d), which itself differs from 
the synthetic model based on crystal structures. The TBP-TFIIA-TFIB 
sub-complex in the yeast CC cryo-EM study is rotated away from Pol 
II, which does not contact the DNA. Furthermore, the clamp is closed 
down, as we see for our OC and ITC complexes (see below) or that 
observed for EC structures”®*’. These discrepancies may reflect known 
differences between the yeast and human systems. Whereas the TSS 
is found ~30 base pairs (bp) from the TATA box in metazoans, this 
distance can be as long as 120 bp in yeast**!. We speculate that the 
yeast arrangement would allow more single-stranded DNA to be 
accommodated by the PIC during scanning for the TSS. 


Remodelling of the PIC during promoter opening 

The most notable change between the core CC and OC structures, in 
addition to the DNA itself (see below), is the movement of the Pol II 
clamp and of TFIIE, which directly contacts the clamp (Fig. 4a). The 
TBP-TFIIA-TFIIB-TFIIF sub-complex, defining a structural unit that 
surrounds the upstream core promoter elements, moves slightly, maybe 
as a relaxation process following the release of tension after the down- 
stream bent DNA is melted (Fig. 4a). The bridge helix does not change 
during bubble opening (Fig. 2b, Fig. 4b). However, the non-template 
DNA near the downstream bubble junction clearly tilted the Pol II fork 
loop 2 in the OC, which might therefore function as a sensor of the 
open promoter (Fig. 2b, Fig. 4b, c). This loop stays tilted in both the 
ITC and ITC(—IIS) states, further supporting our hypothesis (Fig. 2b). 
A very similar conformation has been recently reported for the bacterial 
ITC* (Extended Data Fig. 8), suggesting the functional conservation 
of this critical element at the initiation stage of transcription across 
different kingdoms. 

The most significant changes during promoter opening concern the 
Pol II clamp and TFIIE (Fig. 4d and Extended Data Fig. 9b). In the CC, 
the clamp coiled-coil domain associates with the C-terminal extension 
of TFITE3. However, in the OC, as the coiled-coil moves down when 
the clamp closes, the connection with TFIIE@ is broken and another 
interface is established with the 4th helix of TFIIEa, shifting the entire 
WH by ~5A. This change probably makes the interaction with TFIIE 
more energetically favourable, explaining the increase in affinity of 
TFIIE observed in the OC and ITC compared to the CC. 

In the CC, Pol II interacts directly with the promoter DNA duplex via 
the 3-strand 6-sheet of the clamp head (Fig. 4e, Extended Data Fig. 9c) 
and the 2-helix bundle in RPB5, (Fig. 4f, Extended Data Fig. 9d), imme- 
diately upstream and downstream of the initiator (INR) element (and 
thus the TSS). In the OC, the clamp head and RPBS still engage the 
DNA, but in a slightly different manner (Fig. 4e, fand Extended Data 
Figs 9c, d). 

For the elements described, there are no significant changes between 
the OC and the ITC and ITC(—IIS) states (Extended Data Figs 9a—d). 
Notably, the CC reconstruction does not show any detectable density 
for the TFIIB linker, indicating that it is disordered before promoter 
opening (Extended Data Figs 5a, 9a). However, in the OC, ITC and 
ITC(—IUS) structures, the B-linker becomes stabilized by direct inter- 
action with the clamp coiled-coil and rudder on one side, and the 
non-template DNA strand on the other (Fig. 4c, e and Extended Data 
Figs 5a, 9a). This intricate protein-DNA network likely plays a key role 
in stabilizing this part of the initiation bubble, which is fairly solvent 
accessible. 


DNA transition through initiation into elongation 

The full promoter DNA is clearly visible in all our reconstructions, 
making it possible to compare its path throughout these transcription 
stages (Fig. 5). The upstream DNA, which includes BREu, TATA, and 
BREd elements, is tightly engaged during all stages of transcription 
initiation by the TBP-TFIIA-TFUB-TFIIF sub-complex (Fig. 1c, d). 
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Figure 4 | Structural transitions during promoter opening. a, Colour- 
coded motion of backbone atoms between CC and OC based on r.m.s.d. 
b, Change of fork loop 2 near the active site between the CC (green) 

and OC (coloured) (see also Fig. 2b). A similar colour scheme is used 

for the superposition of CC and OC states in the third column of d-f. 

c, Segmented cryo-EM density from the OC reconstruction and the 
corresponding model for TFIIB and various Pol II loops critical for 
stabilization of the transcription bubble. The density for the nucleic acids 
has been omitted for clarity. d, Changes in the contact between the RPB1 
coiled-coil and TFIIE during bubble opening. e, Changes near the melting 
start site involving the clamp head, coiled-coil domain, rudder domain and 
TFIIB. f, Changes in Pol II interaction with downstream DNA involving 
the clamp head and RPBS. 


After the BREd, the path of the promoter in the CC deviates from 
B-form DNA due to contact with the clamp and RPB5 around the TSS, 
and further downstream by XPB within TFIH (Fig. 5a). The effect 
of XPB binding can be inferred by comparing the present CC struc- 
ture with our previous cryo-EM structure of core CC in the absence of 
TFIIH"! (Fig. 5a). TFITH causes both a kink at the RPB5 contact, and 
a slight bend of the DNA at the site of contact with XPB (Fig. 5a, b; see 
also Fig. 3f). The PIC conformation maintaining this TFIH-engaged 
DNA state is probably under strain, primed for DNA unwinding by the 
ATPase activity of XPB (see below). 

In the OC cryo-EM map, two continuous densities connecting the 
upstream and downstream duplex DNA correspond to the separated 
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Figure 5 | Nucleic acids rearrangements between stages of transcription. 
a, DNA paths before’' (green) and after (blue/cyan) TFIIH incorporation 
into the CC. XPB is coloured navy blue and the rest of the PIC is shown in 
transparency. b, Section through the CC density map and model showing 
the path of the DNA and its engagement by TFIIH. c, Segmented electron 
microscopy densities and models of promoter DNA in the CC and OC. 

d, Section through the OC density map and model showing the path of the 


DNA strands (Fig. 5c). The template strand follows the previously 
characterized tunnel!® defined by TFIIB, and the wall, rudder, and 
fork loops of Pol II, whereas the non-template DNA is mainly stabi- 
lized by the B-linker and the Pol II fork loops (Fig. 4c and Extended 
Data Fig. 9a, c). Interestingly, while we used an 11 bp mismatch 
DNA scaffold for the OC, from position —9 to +2 relative to the TSS 
(Fig. 1a), incorporation of the DNA into the complex caused a further 
opening of the bubble by two bps, to position —11 (Extended Data 
Fig. 9e). 

The most notable change between the CC and OC structures con- 
cerns the DNA itself (Fig. 4a), beginning ~20 bp downstream of the 
TATA box, where melting is known to start*® (Fig. 5c). The upstream 
DNA barely moves. The location of the 90° bend of the DNA by TBP, 
further stabilized by TFIIA, TFIIB and TFIIF, likely reinforces and 
maintains the relative position of the immobilized upstream DNA 
with respect to Pol II, while a force is applied by TFHH at the down- 
stream end of the promoter. The minor groove of the DNA contacted 
by RPB5 superimposes nearly perfectly for the CC and OC recon- 
structions, whereas the major grooves flanking this point change to a 
different extent (Figs 4f, 5c). In the CC, there is a bend of the DNA at 
the RPB5 contact point, whereas the DNA upstream from that point 
of contact more closely resembles a B-form configuration than pre- 
viously captured in the absence of TFIIH"’ (Fig. 5a). In the OC, the 
double-stranded DNA between the active site and the contact with XPB 
has turned as it slides past the RPB5 contact (Fig. 5c, d). In the process, 
the kink of the DNA by RPB5 disappears and with it probably also 
the tension that such a state may impose on the PIC. In contrast, the 
DNA downstream of the RPB5 contact does not change greatly between 
the CC and OC states, and XPB maintains an unchanged engagement 
with DNA. 

There are no detectable changes in the double-stranded DNA path 
between the OC and the ITC, either upstream or downstream of 
the transcription bubble. The paths of the single-stranded template 
DNA are more similar than those of the non-template DNA (Fig. 6). 
Furthermore, whereas the template DNA in both states is visible as 
a continuous density, there are obvious breaks in the density corre- 
sponding to the non-template strand for the ITC. We propose that 


DNA and its engagement by TFIIH. e, Comparison of the DNA path in our 
ITC(—IIS) state (blue/cyan) with that in the bovine EC structure* (green). 
Segmented electron microscopy density of the EC single-stranded non- 
template DNA is shown to illustrate its path, as this part was not modelled 
in the published structure. f, Close-up views at the active site comparing 
the position of fork loop 2 relative to the nucleic acids and bridge helix in 
the human ITC(—IIS) (top) and bovine EC (bottom). 


such breaks in density might be caused by disorder where scrunching 
is occurring. 

Through all the states analysed, the downstream DNA engaged by 
XPB is in a duplex conformation (Fig. 3f), in agreement with previous 
translocase models for XPB®'“*, The superimposition of the CC and 
OC states shows that XPB must translocate by 12 bp along the DNA to 
position the TSS into the active site of Pol II (Fig. 5c). The same com- 
parison reveals a net rotation of TFIIH by ~10° with respect to the core 
PIC between the two states (Extended Data Fig. 3j and Supplementary 
Video 3). The rotational axis passes through two critical contact points: 
one near the TFIIE-TFIIH interface and the other by the DNA-RPB5 
interaction. TFIIH is flexibly associated to the core PIC in each of the 
states described (Extended Data Figs 2, 3), enabling XPB to reach the 
downstream DNA while TFI[H maintains contact with TFIIE. As men- 
tioned above, initial binding of TFIIH already causes a small rearrange- 
ment of the DNA (Fig. 5a) that may prepare the promoter for melting 
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Figure 6 | Possible region of DNA scrunching during initiation. 

a, b, Segmented cryo-EM densities and models for DNA and RNA near 

the Pol II active site for the OC (a) and ITC(—IIS) (b). TFIIB and Pol II 
loops that are critical for stabilization of the bubble are also shown (density 
omitted for clarity). c, Superimposition of the OC (green) and ITC(—IIS) 
(coloured) states. The lack of visible density (arrow) is interpreted as 
disorder at the site of scrunching, upstream of the contact with the TFIIB 
linker. 
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at the correct position. Pulling of TFHH by this strained DNA results 
in a slightly different trajectory of motion for the CC compared to the 
other states (Extended Data Figs 2, 3). Once the OC state is reached, 
the range of motion of TFIIH changes, and its average position rotates 
as indicated in Extended Data Fig. 3j. Through the transition, TFITH 
would push the DNA past the RPB5 contact, which thus serves as a 
pivot point for both DNA and TFIIH (Supplementary Video 3). Within 
the context of this flexible attachment of TFIIH, the preservation of the 
XPB-DNA contact, at a similar position with respect to the rest of the 
PIC for all the states analysed, is in good agreement with a transloca- 
tion model in which the opening of the transcription bubble occurs 
by XPB threading the DNA into the Pol II cleft, past the RPB5 contact 
(Fig. 5b, d, Supplementary Video 3). 

When we compare our ITC(—IS) with a recent cryo-EM structure 
of a bovine EC*, the RNA-DNA hybrid and the downstream DNA 
superimpose, but there are significant differences for the rest of the 
DNA (Fig. 5e). The upstream duplex DNA in the EC is collapsed by 
10 bp (RNA was normalized to the same length), whereas the path is 
shifted by ~15 A towards TFIIB, to the location originally occupied by 
the N-terminal cyclin fold. This shift would happen at the point when 
the growing RNA has extended to ~12-13 nucleotides* and starts to 
displace TFIIB. Breaking contact with TFIIB, the major connection of 
Pol II with the rest of the PIC, should thereafter allow Pol II to clear 
the promoter. 

The single-stranded DNA follows slightly different paths for the ITC 
and EC, partly because of the notable difference in the upstream duplex. 
Before promoter clearance, extra template DNA has to be fitted within 
the Pol II cleft, probably resulting in scrunching in the ITC*”*. This 
scrunching could explain the differences we observe for the non-tem- 
plate DNA between our ITC and the EC. The conformation of fork 
loop 2 in the EC is identical to what we observe in the OC, ITC and 
ITC(—US) states (Figs 5f, 2b, 4b), indicating that this element, probably 
important for sensing the CC to OC transition by interacting with the 
single-stranded non-template DNA, remains tilted during the OC to 
ITC to EC transitions. 

Our cryo-EM study, which provides nearly complete pseudo-atomic 
models of all the structural elements within a functional human PIC, 
has allowed us to define the structural transitions through the pro- 
cesses of DNA engagement by the CC, opening of the transcription 
bubble in the OC, and initiation of transcription in the ITC. The pres- 
ent structures constitute a comprehensive structural framework for past 
and future studies of the complex process of eukaryotic transcription 
initiation. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
Transcription complex assembly and purification. TBP, TFIIA, TFIIB, TFIIE, 
TFIF and TFIIS were recombinantly expressed and purified from Escherichia coli. 
Pol II and TFIIH were immunopurified from HeLa cell nuclei (that tested negative 
for mycoplasma) following previously established protocols°®°'. The design of 
the DNA construct was based on the super core promoter”, except that a BREu 
element was introduced upstream of the TATA box*® and an EcoRI restriction 
enzyme site was included downstream of the INR element for purification purposes 
(template, 5’-ACTGGGGAAT TCCATGGTCCGTAGGCACGTCTGCTCGGCT 
CGAGTGTTCGATCGCGACTGAGGACGAACGCGCCCCCACCCCCTTTTA 
TAGGCGCCCTTC-3’; non-template 1, 5’- GAAGGGCGCCTATAAAAGGGG 
GTGGGGGCGCGTTCGTCCTCAGTCGCGATCGAACACTCGAGCCGAG 
CAGACGTGCCTACGGACCATGGAATTCCCCAGT-3’). The nucleic acid 
scaffolds that were used to generate the PICs in the open and initial transcribing 
conformation were designed by modification of the promoter substrate used to 
form the closed PIC. 11 bp or 19 bp mismatch sequences composed of poly-T were 
introduced to mimic the different sizes of initiation bubbles in the open and initial 
transcribing conformation during promoter opening, respectively (Fig. 1, top). 
Though the mismatch bubble stops at —9 in the open conformation, PIC assembly 
further unzips two extra base pairs at the upstream end of the bubble. Mismatch 
bubbles that stop at —11 and —7 were tested in the ITC reconstruction. The latter 
scaffold did not generate a stable complex, indicating that the GIFs can open up 
two, but not four, base pairs in the context of the core PIC. An RNA-DNA duplex 
beyond 7 bp or 12 bp has been proposed to be the trigger for TFIIB release and 
promoter escape®!®!8-46°3, Thus, we introduced a 6 nucleotide (nt) RNA primer 
complementary to the template strand, starting at the TSS used in our study 
(non-template 2, 5’- GAAGGGCGCCTATAAAAGGGGGTGGGGGCGCGAAT 
TTTTTTTTTCGCGATCGAACACTCGAGCCGAGCAGACGTGCCTACGGA 
CCATGGAATTCCCCAGT-3’; non-template-3, 5’-GAAGGGCGCCTATA 
AAAGGGGGTGGGGGCGTTTTTTTTTTTTTTTTTTTTCGAACACTCGA 
GCCGAGCAGACGTGCCTACGGACCATGGAATTCCCCAGT-3’; RNA, 
5’-AGUCGC-3’). A biotin tag was engineered at the 5’ end of the template strand 
(Integrated DNA Technologies). The duplexed DNA was generated by annealing 
the template strand with equimolar amounts of single-stranded non-template 
DNA at a final concentration of 501M in water. The annealing reaction was 
carried out at 100°C for 5 min and gradually cooled down to room temperature 
within 2h. 

PIC in the closed conformation was assembled as initially described", except 
for an additional incubation of TFIIS at a final concentration of 200 nM with the 
purified PIC before application to the EM grid'*. PICs in the open and initial tran- 
scribing conformation were assembled in a similar fashion, but using the appro- 
priate nucleic acid scaffolds instead of DNA duplex. Purified PIC complexes were 
crosslinked after elution by incubation with glutaraldehyde at a final concentration 
of 0.05%, on ice and under very low illumination conditions, for 5 min, then imme- 
diately used for cryo-EM sample preparation. 

Electron microscopy. Preparation of PIC samples for cryo-EM observation was 
carried out using 400 mesh C-flat grids containing 4 |1m holes with 4\1m spacing 
(Protochips). A thin carbon film was floated onto the grid before it was plasma 
cleaned for 5s using a Solarus plasma cleaner (Gatan) under a 75% argon/25% 
oxygen gas mix, immediately before sample deposition. An aliquot (3 yl) of the 
purified sample (~100 nM) was placed onto the grid and loaded into a Vitrobot 
(FEI) operated at 100% humidity and 4°C. The sample was allowed to absorb for 
5 min (under low illumination conditions), then was blotted for 4 s and imme- 
diate plunged into liquid ethane. The frozen grids were stored in liquid nitrogen 
until loaded into a Titan electron microscope (FEI) operating at 300 keV using a 
626 single-tilt cryotransfer system (Gatan). Data were acquired with a K2 sum- 
mit direct electron detector (Gatan) operating in counting mode at a calibrated 
magnification of 37,879 x (1.31 A per pixel), using a range of defocus values (from 
—2 to —4y.m). Between 855 and 1334 movie series for each of the cryo-EM data 
sets were collected using the MSI-T application of the Leginon data collection 
software**. 30-frame exposures were taken at 0.3 s per frame (9 s total exposure 
time), using a dose rate of 8e” per pixel per second, corresponding to a total dose 
of 42 e~ A~? per movie series. 

Image processing. Movie frames were aligned using MotionCorr to correct for 
specimen motion?’. The average of the aligned frames was used for data pre- 
processing within the Appion processing environment”*. Particles were automati- 
cally selected from the aligned micrographs using a difference of Gaussians (DoG) 
particle picker®’. For CC, OC, ITC and ITC(—IIS), 245,501, 328,720, 366,145, and 
434,396 particles were picked from 855, 966, 1099, and 1,344 micrographs, respec- 
tively (Extended Data Figs 2-4). The contract transfer function (CTF) of each 
micrograph was estimated using the CTFFind3 program”® during data collection. 


All three-dimensional (3D) classification and refinement steps were performed 
within RELION” (version 1.4-beta). 

Three-dimensional reconstruction. For the CC, the initial set of 245,501 particles 
was subjected to an initial 3D classification, using the negative stain reconstruction 
of a similar complex"! (in which the nucleic acid is not visible and TFIIS is absent) 
low-pass filtered to 60 A as the initial reference (Extended Data Fig. 2c). Because 
of the use of such reference, the presence of density for the DNA and for TFHS in 
the reconstruction can be used as clear signs that features are not arising owing 
to noise alignment to the references. One out of five classes in this classification, 
corresponding to 34,728 particles, was indicative of well-preserved holo-complex 
with sharp structural feature and was selected for further processing. This particle 
set was then subject to 3D auto-refinement within RELION, resulting in a recon- 
struction of the CC at an overall resolution of 7.2 A (Extended Data Fig. 2b). All 
resolutions reported herein correspond to the gold-standard Fourier shell corre- 
lation (FSC) using the 0.143 criterion®. Local resolution estimation indicated that 
the density for TFIIH core (the kinase module is too flexible to be visualized) was 
at lower resolution than the PIC core containing the rest of the proteins, probably 
owing to conformational heterogeneity (Extended Data Fig. 2c). Indeed, further 3D 
classification within RELION successfully separated the data set into two, resulting 
in two distinct locations of the TFIIH relative to the PIC core (Extended Data Fig. 2c). 
This flexibility of a major part of the complex would also likely have contributed 
to the limited resolution of the rest of the CC reconstruction. Therefore, a soft 
mask was applied around the PIC core density (that is, excluding TFIIH) during 
further 3D refinement within RELION, thus effectively excluding the contribution 
of the TFIIH signal from the final alignment rounds. This procedure resulted in 
an improved reconstruction of the PIC core, with an overall resolution of 5.4A 
(Extended Data Fig. 2b, c). 

For the OC, the initial set of 328,720 particles was subjected to an initial 3D 
classification, similar to the above CC (Extended Data Fig. 3c). One out of five 
classes in this classification, corresponding to 59,271 particles, was indicative of 
a well-preserved holo-complex and was selected for further processing. This par- 
ticle set was then subject to 3D auto-refinement within RELION, resulting in a 
reconstruction of the OC at 8.6 A resolution (Extended Data Fig. 3b). Further 
3D classification within RELION also successfully separate the data set into two, 
resulting in two distinct locations of TFIIH relative to the PIC core (Extended 
Data Fig. 3c). However, the movement trajectory of TFIIH found in the OC was 
different from the one found in the CC. The same initial data set was also used to 
derive the reconstruction of the PIC core using the negative stain reconstruction 
of a similar complex"! (in which the nucleic acid is not visible and TFIIS is absent) 
low-pass filtered to 60 A as the initial reference (Extended Data Fig. 4c). One out of 
five classes in this classification, corresponding to 79,849 particles, was indicative 
of well-preserved full core complex and was selected for further processing. This 
particle set was then subject to 3D auto-refinement followed by particle polishing 
procedure within RELION® in order to correct for individual particle motion and 
beam-induced radiation damage of the sample. Dose-dependent relative B-factor 
was estimated per frame and applied during the particle polishing step (Extended 
Data Fig. 4a). This resulted in a reconstruction of the OC in the absence of TFUH 
at an overall resolution of 3.9 A (Extended Data Fig. 4b). Similar procedures were 
taken for processing the data set for the ITC and ITC(—US) (Extended Data 
Figs 3, 4). 

For the TFIIH core complex, three data sets corresponding to the OC, ITC and 
ITC(—IIS) were merged together following the initial 3D classification within each 
data set giving rise to the holo-PIC (Extended Data Fig. 6a). A similar approach to 
that used for deriving the PIC core within the CC was taken by applying a soft mask 
around the TFIIH core density during further 3D refinement within RELION, 
effectively excluding the contribution of core PIC signal during the alignment. 
This procedure resulted in an improved reconstruction of the TFIIH core, 
with clear observation of the major and minor grooves of the bound duplex DNA 
(Fig. 3f and Extended Data Fig. 6a). 

Local resolution calculation was performed for all reconstructions using the 
‘blocres’ function in the Bsoft package®® (Extended Data Figs 2-4, 6a). The final 
density maps were automatically sharpened using the post-processing program 
within RELION and then filtered according to local resolution using the “blocfilt’ 
function within Bsoft. Volume segmentation, automatic rigid-body docking, figure 
and movie generation were performed using UCSF Chimera™. 

Building initial PIC atomic models. To interpret the structural mechanism in 
more detail, four models of the PIC (CC, OC, ITC and ITC(—IIS)) in different 
functional states were built. We first generated an initial homology model of human 
RNA polymerase II (Pol II) in complex with TFIIB using MODELLER®. We used 
the sequences of the twelve Pol II subunits (RPB1 to RPB12) and TFIIB without 
the C-terminal cyclin fold in the crystal structure of yeast Saccharomyces cerevisiae 
Pol II-TFIIB complex (PDB: 4BBR)'” as a template. This homology model for Pol 
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II-TFIIB was placed as a rigid body into the density map using UCSF Chimera™. 
The RPB4 and RPB7 subunits were then replaced by the X-ray structure of human 
RPB4 and RPB7 heterodimer (PDB: 2C35)°”. Analysis of the fit showed that the 
B-linker helix of yeast TFIIB overlapped with the DNA in the cryo-EM reconstruc- 
tion and that it required to be retraced as a loop region. 

The crystal structure of TBP, TFIIB and upstream core promoter DNA duplex 
(PDB: 1C9B)** was placed and rigid-body fitted into the map by aligning TFIIB 
to the TFIIB’s N-terminal cyclin-fold domain. We then extended the model to 
include TFIIA, by superimposing TBP on the TBP-TFIIA-DNA crystal structure 
(PDB: INVP)*!. 

The crystal structure of the human TFIIF dimerization domain (PDB: 1F3U)® 
was docked into the density ascribed to TFIIF by the lobe domain of Pol II. Our 
rigid body fitting of crystal structures results in a small clash between the RPB2 
lobe and the RAP74 C-terminal «1 helix, a region shown to be important in both 
transcription initiation and elongation. RAP74 al helix was rigid-body adjusted 
to better fit the density and extra residues (residues 167-180) in the C terminus 
were included based on the results of secondary structure prediction®®”’. In addi- 
tion to interacting with Pol II, it is evident from the cryo-EM density that TFIIF 
directly contacts the BREd DNA. We previously proposed" that this region corre- 
sponds to the C-terminal winged-helix (WH) domain of RAP30 (PDB: 1BBY)”', 
and we can now dock it with high accuracy into our improved map. TFIIF also 
contributes to the extra density seen linking the C-terminal WH domain and the 
dimerization domain near the protrusion of Pol II right below the central cleft. We 
built the path of RAP30, N-terminal to the WH domain on the basis of secondary 
structure prediction ”® and fitted it into the electron microscopy density. Two 
different protein secondary prediction servers (PSIPRED and SABLE) were used 
for cross-validation purposes”, 

To model the human TFIE WH domains, residues 11-101, corresponding to 
the TFHEa WH, were aligned with Sulfolobus solfataricus TFUE WH residues 
7-88 (PDB: 1Q1H)”. TFIIE8 WH2 residues 150-207 were aligned with Sulfolobus 
tokodaii STO12A residues 25-106 (PDB: 2D1H)’*. The human ZR domain, 
residues 125-164 (PDB: 1VD4)”4, was docked near the base of the stalk domain of 
Pol II. The human TFIJE8 WHI residues 75-146 (PDB: 1D8J)”° were fitted into 
the region proximal to TFIIF RAP30 WH domain. The individual TFHE WH 
domain structures were oriented relative to each other by rigid-body fitting into the 
electron microscopy density. The TFIIEa residue segments 102-124 and 165-204 
and the TFIJES residues 205-239 were modelled on the basis of the observed 
electron microscopy density and consensus secondary structure prediction®””. 
Missing loops and the linker between the WH domains of TFIE were built with 
the program Modeller®®°. 

The X-ray structure of human TFIIS domain II (PDB: 3NDQ) and NMR struc- 
ture of the TFIIS zinc-finger motif (PDB: 1TFI)’° were combined and rigid-body 
fit into the density for the core of Pol H. The linker between domain II and the 
zinc-finger motif was constructed from the yeast TFIIS structure (PDB: 1Y1V)*° 
by homology modelling and fit into the density. 

To model TFHH, we used the crystal structure of the Archaeoglobus fulgidus 
XPB N-terminal domain (PDB: 2FZ4)”’ as a template to build the corresponding 
N-terminal portion of human XPB. The C-terminal domain of human XPB (PDB: 
4ERN)’® was then directly used to fit into our density map as a rigid body before 
connecting with the N-terminal part. We generated a homology model for the 
XPD on the basis of the Thermoplasma acidophilum XPD structure (PDB: 2VSF)”’. 
The N-terminal helix was added by using Sulfolobus acidocaldarius XPD (PDB: 
3CRV)*°. The X-ray structure for Ssl1/P44 from Saccharomyces cerevisiae (PDB: 
4WFQ)®! was used as a template to construct the human P44 structure. We used 
the crystal structure of a minimal complex between Tfb5, the yeast orthologue 
of P8, and the C-terminal domain of Tfb2, the yeast P52 subunit of TFIIH (PDB: 
3DOM)*, as a template to build the human P52/P8 structure. The human structure 
of P52/P8 was then docked into a corresponding density adjacent to the C-terminal 
half of XPB. In order to remove the clashes between the N-terminal helix of P52 
and XPB, we rigid-body adjusted the position of this helix to better fit the density. 
The structure of the N terminus of p52/Tfb2 is predicted to be similar to the HEAT 
repeats of human transportin 3. We used the human X-ray structure of trans- 
portin 4 (PDB: 4C0O)* as a template to model the human P52 N-terminal heat 
repeat. The amino acid sequence of human P52 was aligned with that of human 
transportin 4, initially with the PROMALS3D multiple sequences and structure 
alignment server*, and then adjusted manually. The human p44 VWA domain was 
modelled using the X-ray structure of the p34 subunit of the TFITH complex from 
the eukaryotic thermophilic fungus Chaetomium thermophilum (PDB: 4PN7)*. 

The upstream core promoter DNA elements within the OC, ITC and ITC-TFIIS 
complexes remained bound by TBP, TFHA and TFIIB, in the same configuration 
as in the CC. The bent DNA model in the CC was generated using the 3D-DART 
online server and the 3DNA software package®®. To model downstream DNA, 
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the DNA-RNA hybrid structure from a previous model of the budding yeast 
elongation complex (PDB: 4A3F, 4A3D and 4A3I)° was fit into the density and 
adjusted. The downstream DNA was extended by adding duplex DNA in each 
model. Individual nucleotides in the DNA bubble were adjusted to best represent 
the observed electron microscopy density in this region of the cryo-EM maps. The 
sequences of the DNA substrates (in the electron microscopy experiments and 
the simulation) are shown in Fig. la. These initial models (for core CC, OC, ITC, 
ITC(—HS), and for TFIIH) were subjected to electron microscopy flexible fitting 
and refinement of atomic positions as described in the next section. 

Molecular dynamics flexible fitting and refinement in Phenix. To better capture 
the conformational changes revealed by the cryo-EM maps in different states, a 
molecular dynamics flexible fitting procedure (MDFF) was used to flexibly fit the 
models into the EM density*”. MDFF is a method wherein external forces propor- 
tional to the electron microscopy density gradient are applied, biasing the motion 
of the protein atoms towards the high-density regions of the map. Additional 
restraining forces were also applied during fitting to prevent structural distor- 
tions and preserve the secondary structure elements. Hydrogen atoms, counterions 
(Na*) and TIP3P solvent were introduced using the Tleap module in AMBER10®. 
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Extended Data Figure 1 | Purification and activity of human PIC. a, SDS-PAGE (4-12% gradient gel followed by silver staining) of purified 
transcription factors and assembled PIC on a promoter DNA. b, The purified PIC in a was ran on a 10% TBE-urea gel after supplying ribonucleoside 
triphosphates in a run-off reaction. kDa, kilodaltons. 
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Extended Data Figure 2 | Cryo-EM of the human CC. 

a, Representative raw micrograph. b, FSC curve and estimated 
resolution using the 0.143 criteria following the gold-standard procedure 
implemented in RELION for both the holo-complex and the PIC core. 

c, Refinement strategy for the holo-complex (see Methods). The local 
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complex. 
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Extended Data Figure 3 | Cryo-EM of the human OC/ITC/ITC(—IIS) 
complexes. a, d, g, Representative raw micrograph. b, e, h, FSC curve 
and estimated resolution using the 0.143 criteria following the gold- 
standard procedure implemented in RELION. ¢, f, i, Refinement strategy 
for the holo-complex (see Methods). The local resolution estimation 
shows flexibility for TFIIH. Further 3D classification revealed the range 
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of motion of TFIIH within the complex (pink and blue densities). Notice 
that the direction and range of motion for these three states is similar. 
Compared to the CC, the pink densities are approximately the same, but 
the direction of motion from pink to blue has changed. j, Movement of 
TFIIH with respect to the core PIC from the CC to OC state. 
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Extended Data Figure 4 | Cryo-EM of the core human OC/ITC/ITC(— IIS) 
complexes. a, d, g, Dose-dependent B-factor plot from RELION. 

b, e, h, FSC curve and estimated resolution using the 0.143 criteria 
following the gold standard procedure implemented in RELION. 

c, f, i, Refinement strategy (see Methods). The local resolution estimation 
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Extended Data Figure 5 | Segmented cryo-EM densities and models 
for different regions of interest within the human core PIC in the CC, 
OC, ITC and ITC(—IHIS) states. a, TFIIB and its interaction with DNA. 
b, Extension of TFIIA stabilized by interaction with TBP. c, Trigger loop 
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ITC(-IIS) 
in Pol II, viewed in an ‘off’ state by comparison with crystallographic 
structures of yeast Pol II (2NVZ*° in blue and 1Y1V” in green). d, TFIIE 
and its interacting partners with the PIC. e, TFIIF and its interacting 
partners with the PIC. 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | Cryo-EM reconstruction and interpretation 
of human TFIIH. a, Refinement strategy (see Methods) for the seven 
subunit TFIIH core complex (the CAK subcomplex is too flexible to be 
visualized following averaging). The local resolution estimation for the 
PIC structure obtained by combining the OC, ITC and ITC(—HS) data 
sets shows lower resolution for TFIIH, reflecting its flexible attachment. 
Focused refinement (using the mask marked by dashed lines) allowed us 
to improve the resolution for TFILH. b-d, Comparison of human XPB 
and the ssoRad54 ATPase structures and their interaction with DNA. 

b, Human XPB-DNA model. N terminus is shown in navy blue, C 
terminus in pink and DNA in cyan. ¢c, Crystal structure of ssoRad54 
ATPase (PDB: 1Z63)°?. The N terminus is shown in yellow, the C terminus 
in green and the DNA in cyan. Whereas the N-terminal domains of 


ARTICLE 


both proteins and the DNA can be easily superimposed, the C-terminal 
domains are in very different position. d, A 123° rotation would be 
required to superimpose the C-terminal domains around an axis located 
on the first residues (top, shown in spheres) that connects to the fixed 
N-terminal domain. The rotation axis and planes are coloured in red. 

e, Possible location of the TFIEa-TFIIH/p62 interface. An unassigned 
density in the region of proximity/contact between TFIIE and TFUH 
probably corresponding to the TFIIE-TFIIH interface is marked with 
boxes for each reconstruction. A tentative orientation of the NMR 
structure for a short C-terminal segment of TFIIEa bound to the PHD 
domain of p62 (PDB: 2RNR)” is proposed in the centre for that flexible 
density region. 
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Extended Data Figure 7 | Position of the TBP-IIA-IIB module and of a recently reported cryo-EM structure of yeast CC (green)’* and the 
mobile elements of Pol II. a-~c, Comparison between a ‘synthetic’ human CC in this study. The structures were aligned using the rigid part of 
structure of the CC, generated by superimposing human TBP-TFIIA-DNA Pol II (that is, excluding the clamp and the stalk). The yeast TBP-TFIIA- 
(PDB: INVP)?!, human TBP-TFIIB-DNA (PDB: 2C9B)°®, and, most TFIIB-DNA module and the mobile regions of Pol II (clamp and stalk) are 
notably, the yeast Pol II-TFIIB (PDB: 4BBR)'° using common elements in different relative positions. Whereas the TBP-TFIIA-TFIIB module and 
(green), and the human cryo-EM CC model (coloured). A number of the clamp and stalk element resembles those in the ‘synthetic’ model (a-c), 
elements are rotated between the two: the TBP-IIA-IIB-DNA subcomplex the path of the DNA is very different, in that it moves away, rather than 
(a), the Pol II clamp (b), and the Pol II stalk (c). The rotation plane and towards the Pol II stalk. 


angle are depicted in red. d, Comparison, shown in two different views, 
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Extended Data Figure 8 | Comparison of eukaryotic and bacterial a very similar manner to that observed in our OC, ITC and ITC(—HS) 
initiation complexes around the active site. a, b, Equivalent close-up structures (see Figs 2b and 5f). Domain 2 within the bacterial o factor is 
views of our ITC(—IIS) model (a) and the crystallographic structure of involved in stabilizing the non-template DNA in a similar manner as the 
a bacterial initiation complex (4G70”%) (b). The fork loop 2 is tilted in TFIIB linker region in human initiation complex. 
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Extended Data Table 1 | Statistics after refinement of atomic coordinates with Phenix for the core PIC models 
(OC, ITC, ITC(—IIS) and CC), and scores from MolProbilty structural analysis 
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Interacting compact binary systems provide a natural laboratory 
in which to study irradiated substellar objects. As the mass-losing 
secondary (donor) in these systems makes a transition from the 
stellar to the substellar regime, it is also irradiated by the primary 
(compact accretor)'*. The internal and external energy fluxes 
are both expected to be comparable in these objects, providing 
access to an unexplored irradiation regime. The atmospheric 
properties of donors are largely unknown’, but could be modified 
by the irradiation. To constrain models of donor atmospheres, it 
is necessary to obtain accurate observational estimates of their 
physical properties (masses, radii, temperatures and albedos). 
Here we report the spectroscopic detection and characterization 
of an irradiated substellar donor in an accreting white-dwarf binary 
system. Our near-infrared observations allow us to determine a 
model-independent mass estimate for the donor of 0.055 + 0.008 
solar masses and an average spectral type of L1 + 1, supporting 
both theoretical predictions and model-dependent observational 
constraints that suggest that the donor is a brown dwarf. Our time- 
resolved data also allow us to estimate the average irradiation- 
induced temperature difference between the dayside and nightside 
of the substellar donor (57 kelvin) and the maximum difference 
between the hottest and coolest parts of its surface (200 kelvin). The 
observations are well described by a simple geometric reprocessing 
model with a bolometric (Bond) albedo of less than 0.54 at the 2c 
confidence level, consistent with high reprocessing efficiency, but 
poor lateral heat redistribution in the atmosphere of the brown- 
dwarf donor*». These results add to our knowledge of binary 
evolution, in that the donor has survived the transition from the 
stellar to the substellar regime, and of substellar atmospheres, in 
that we have been able to test a regime in which the irradiation and 
the internal energy of a brown dwarf are comparable. 

So far, only a single brown-dwarf donor to an accreting white dwarf 
has been detected spectroscopically*, and no empirical information is 
available regarding the effect of irradiation on the atmosphere of the 
donor. We have therefore carried out simultaneous optical and 
near-infrared time-resolved spectroscopy of one such system with the 
X-Shooter instrument® at the Very Large Telescope (European 
Southern Observatory, ESO). Our target, SDSS (Sloan Digital Sky 
Survey) J143317.78+101123.3 (J1433 hereafter; ref. 7), is an eclipsing 
system with a short orbital period (Po, = 78.1 min) and a donor mass 
that is probably well below the hydrogen-burning limit, as estimated 
from eclipse modelling (M2 = (0.0571 + 0.0007) Mo; refs 8, 9). The 
exceptionally wide wavelength coverage provided by our data set 
(0.35-2.5 1m) allows us to confidently dissect the overall spectral 
energy distribution. This is illustrated in Fig. 1. Disk-formed, dou- 
ble-peaked hydrogen emission lines can be seen across the entire spec- 
tral range. The presence of broad Balmer absorption lines between 
0.3 4m and 0.5\1m shows that the white dwarf is the strongest emitter 


in this wave band. However, the spectrum of the donor star is clearly 
visible in the near-infrared and dominates between 1.0,1m and 2.5 um. 

The clear donor signature in the near-infrared allows us to isolate 
the spectrum of the donor and determine its properties. We decom- 
posed the average spectrum by fitting the optical region with a white- 
dwarf atmosphere model"? and an accretion disk (modelled as a simple 
power law). Both of these contributions were then extrapolated to 
the near-infrared and subtracted to retrieve a pure donor spectrum. 
This donor spectrum was then compared to an empirical brown-dwarf 
spectral sequence!’. The best-matching spectral type was found to be 
SpT = L1 = 1, which agrees remarkably well with semi-empirical evo- 
lutionary predictions’” and with previous narrow J-band estimates’. 

These results imply that the donor in J1433 has successfully under- 
gone the transition from the stellar to the substellar regime. The 
alternative—that the secondary was born as a brown dwarf—is 
extremely unlikely. First, extreme-mass-ratio binary systems containing 
very-low-mass objects (M <0.1Mo) are intrinsically rare’*. Second, the 
mass of the accreting white dwarf in J1433 (Mwp = (0.80 + 0.07)Mo) 
is exactly in line with the mean white-dwarf mass of known accreting 
white-dwarf systems going through the standard evolution channel’, 
but much higher than the average mass of isolated white dwarfs 
((Mwp) *0.6M; ref. 13) and that of any known primary in a white- 
dwarf-brown-dwarf binary system". Third, the orbital period of J1433 
lies squarely within the ‘period minimum spike’ that coincides with the 
stellar-to-substellar transition of the donor star in systems born with 
main-sequence companions’. By contrast, systems born with substellar 
donors would be expected to populate a wide range of orbital periods 
below this spike!®. However, no such systems have been found so far. 
Indeed, the only accreting white dwarf with a substellar donor and an 
orbital period well below the period minimum has been shown to bea 
low-metallicity object in the Galactic halo’’, rather than a system born 
with a brown-dwarf secondary. 

Models of irradiated planets suggest that irradiation can increase 
photospheric temperatures by an order of magnitude compared to an 
isolated object, with substantial effects on the radius and atmospheric 
structure of the planet!®. Furthermore, a long-standing mismatch 
between the predicted and observed minimum period for accreting 
white-dwarf binary systems can be traced to an approximately 10% 
offset between the donor radii inferred from observations and those 
predicted by theoretical model atmospheres’. However, such large 
effects are only possible if the irradiating flux is efficiently absorbed 
in the atmosphere and redistributed across the terminator, that is, 
from the dayside to the nightside*”’. This is necessary because it is 
the blocking effect of the incoming radiation on the outward heat flux 
from the interior of the secondary that drives the irradiation-induced 
swelling of the donor (secondary). 

Globally, the bolometric flux emitted by the irradiated donor must 
always balance the sum of its intrinsic flux and the irradiation flux it 
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Figure 1 | The average optical and near-infrared spectral energy 
distribution of SDSS J143317.78+101123.3. a, The overall fit to the 
spectra (black line) using a white-dwarf atmosphere model (light blue line) 
and brown-dwarf templates (yellow line) was performed. The contribution 
from the accretion disk (dark blue dashed line) was described as a simple 
power law with an index a = —7/3. We obtained a best fit (red line) with a 
spectral type of SpT = L1 + 1. GALEX FUV, NUV (blue stars), SDSS (green 
line, purple circle), UKIDSS (orange squares) and WISE (green diamonds) 


has absorbed?!. However, in the absence of redistribution, this balance 
must be maintained locally in the atmosphere. This should produce 
pronounced temperature differences between the dayside and night- 
side on the tidally locked donor. Because the system is seen close to 
edge-on, such differences should manifest as changes in the apparent 
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Figure 2 | Ellipsoidal and irradiation effects associated with the 
substellar donor in SDSS J143317.78+101123.3. We find a difference 
in spectral type between the dayside and nightside hemispheres of 
ASpT * 1, equivalent to AT ~ 57 K with a maximum temperature 
difference of approximately 200 K. a, Temperature distribution of the 
atmosphere of the donor as a function of phase. b, 1.3-11m water-band 


photometry are shown for reference; vertical error bars represent the lo 
confidence intervals and horizontal error bars represent the width of the 
filter. b, Trailed, phase-binned spectra around the K 1 doublet at 1.2436 1m 
and 1.2528 \.m. These absorption features arise in the atmosphere of the 
donor star (greyscale), enabling the determination of orbital parameters 
such as the semi-amplitude K) = 499 + 15km s“|, the rotational 
broadening of the donor vsini= 131 -+46km s“' and the systemic velocity 
y= —42+8kms"! (red line). 


spectral type of the donor as a function of orbital phase. We have tested 
for this by measuring the HO 1.3-.m-band index'!. Water vapour is 
the most sensitive absorber in irradiated hot Jupiters”*, and water- 
band strength has been shown to correlate well with spectral type and 
effective temperature’’. As shown in Fig. 2b, we detect a spectroscopic 
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depth, (F)=1.318um)/(F)=1.340 4m), Measurements (circles) with the best-fit 
model (solid line) and a model with no irradiation source (dashed line) 
for comparison. c, Timed-resolved J-band photometry (circles) after 
subtracting the contributions from the white dwarf and the accretion disk. 
Lines represent the same models as in b. Error bars in b and ¢ represent lo 
confidence intervals. 
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change of ASpT ~ 1 (that is, M9 to LO), with a maximum at phase of 
0.5, precisely where the dayside of the donor is exposed. 

On its own, this spectroscopic signal would be only marginally sig- 
nificant, because a model without irradiation can be ruled out at only a 
20 confidence level. However, an independent irradiation signature is 
provided by the observed broadband flux variations around the orbit, 
which are shown in Fig. 2c. This figure reveals a double modulation 
over one orbit, which is produced by the combination of two effects: 
the distorted shape of the tidally locked, Roche-lobe-filling donor star 
(ellipsoidal variations) and the temperature difference between its 
dayside and nightside (irradiation effect). To quantify the irradiation 
effect, we used the binary light curve synthesis code ICARUS”? to fit 
both the water-band and broadband modulations under the assump- 
tion of no heat redistribution between hemispheres. As shown in Fig. 2, 
the resulting fit is acceptable and provides a plausible representation 
of the temperature distribution across the surface of the donor. The 
average temperature of the dayside is (Tjay) = 2,401 + 10 K and of the 
nightside is ( Thight) = 2,344 + 7 K. These numbers are consistent with 
the mean donor temperature obtained from the template-calibrated 
spectral-type measurements. 

The observed difference between the dayside and nightside fluxes is 
statistically significant and allows us to estimate the reprocessing effi- 
ciency of the donor. This efficiency is usually parameterized via the bolo- 
metric (Bond) albedo Ag, which is the fraction of the incident irradiation 
flux that is not reprocessed, but reflected back into space. The irradiating 
source in the system is the hot white dwarf. We therefore obtained an 
improved measurement of its temperature, Twp = 13,200 + 200K, by 
making use of ultraviolet observations obtained by the GALEX satellite. 
With this, we find Ap < 0.54 at 20 confidence, as shown in Extended 
Data Fig. 1. This low value implies a high reprocessing efficiency in 
the atmosphere of the donor. We can also use the observed difference 
between dayside and nightside fluxes to estimate the efficiency € with 
which radiation absorbed locally is redistributed across the donor (see 
Methods). We find a limit for the redistribution efficiency of € < 0.54 at 
20 confidence. Efficient reprocessing, coupled with poor heat redistri- 
bution, has also been found in hot Jupiters in this atmospheric temper- 
ature regime” and been suggested for low-mass stars in accreting white 
dwarfs". If irradiation is the main cause of the larger-than-predicted 
donor radii)’, then efficient heat redistribution is required. Therefore, 
our finding that the heat transfer from the dayside to the nightside 
must be modest suggests that irradiation is unlikely to be responsible 
for inflating the donors. Instead, the dominant effect of irradiation is 
simply an increase in the local temperature’. 

In most hot Jupiters, the external irradiation overwhelms any 
internal heat flux. The relatively milder irradiation experienced by 
the donor in J1433, coupled with the unique prospect of obtaining 
detailed phase-resolved spectra and line profiles, will permit quanti- 
tative tests of irradiated model atmospheres in a regime where these 
observables are expected to be quite sensitive to irradiation. The sys- 
tem also provides an independent benchmark for theoretical models 
that predict the albedo and heat redistribution efficiency in irradiated 
planetary atmospheres”. Finally, the donor in J1433 is a rapid rotator 
(with a projected rotational velocity of 131+46km s~'), which could 
have interesting and substantial effects on its atmospheric dynamics”. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Mass determination. We performed a radial velocity analysis on the 1.243-j1m 
and 1.252-\1m K 1 absorption lines formed in the atmosphere of the donor. 
The trailed spectrum of these features is shown as the inset in Fig. 1. We per- 
formed a simultaneous fit to both lines and found the best values for the semi- 
amplitude K, =499 + 15km s“', the systemic velocity y= —42-+8km s_! and the 
rotational broadening vsini= 131 + 46km s~!. When combined with the observed 
value for the radial velocity semi-amplitude of the white dwarf, K, =34+4kms"! 
(ref. 25), these best-fit values allow us to derive a purely dynamical estimate of 
the mass ratio of the system: gq = M>/M, = K,/K, = 0.068 + 0.008. For this mass 
ratio, the occurrence of white-dwarf eclipses alone sets a lower limit on the 
inclination of i> 80° (ref. 26). We can then use the mass function of the secondary, 
f (Mz) = PxpK?/(20G) = Mysin*i/(1 + q)? where G is the gravitational constant, 
to set a purely dynamical 20 upper limit on the mass of the donor star, 
M) <0.071Mo. We can also use the mass function to obtain a model-independent 
estimate of the donor mass. In any semi-detached compact binary system, there 
is a unique, single-valued family of i-q pairs that produce a compact object eclipse 
with a given width”*. In the case of J1433, the width of the white-dwarf eclipse can 
be measured directly from published optical light curves'’. We can then combine 
the resulting i-q constraint with our spectroscopic mass ratio and the mass func- 
tion to obtain a robust estimate of the donor mass that depends only on Kepler’s 
third law and the well-understood geometry of Roche-lobe-filling objects. We 
find Mz = (0.055 + 0.008) Mo, well below the hydrogen-burning limit. 
Water-band ratios and broadband fluxes. To provide the cleanest measurements of 
the donor at any given phase, we subtracted contributions from both the white dwarf 
and the accretion disk. The flux observed from the white dwarf was assumed to be 
constant in phase, except for phase bins affected by an eclipse. In these, we reduced 
the flux by the fraction of the bin width during which the white dwarf was occulted. 
The contribution from the accretion disk was modelled by fitting a power law to the 
optical (3,200-5,400 A) wave band (while masking the emission lines) and extrapo- 
lating to the near-infrared. Owing to the low signal-to-noise ratio of the individual 
spectra, the telluric removal created artefacts in some regions of the spectra. We 
identified these regions and masked them before shifting each spectrum to the rest 
frame of the donor. We binned the data into eight orbital phase bins and measured 
a slightly modified water-band index, defined as (F)\ =1.318m)/(F\=1.340,m)> Where 
the fluxes F) are averages over a 5-A window. This index was then calibrated against 
brown-dwarf spectral-type templates''. Errors were estimated via 500 bootstrap 
copies of the median flux inside every orbital bin for all the spectra. The broadband 
flux modulation was estimated by integrating the donor flux over the J band (after 
masking all emission lines) and converting to AB magnitudes. 

Dayside and nightside temperatures, albedo and intrinsic luminosity. To model 
the orbital-phase-dependent water-band ratios and broadband flux, we constructed 
a grid of irradiated donor models with ICARUS”. Each of these models is charac- 
terized by two parameters: the irradiating luminosity absorbed by the donor and 
the intrinsic luminosity it would produce in the absence of irradiation. For each 
parameter pair, we use ICARUS to calculate the resulting temperature distribution 
across the donor and to predict the observed spectrum at each orbital phase. These 
calculations assume that all of the irradiating flux absorbed at a given point on the 
surface of the donor is re-emitted locally and neglect any effect of the fast rotation 
of the donor on the temperature distribution (which could affect the atmospheric 
dynamics”). However, the distorted shape of the Roche-lobe-filling donor, as well 
as limb- and gravity-darkening, are fully taken into account in these calculations. 
The absorbed irradiating and intrinsic luminosities are not directly observable, 
but they determine the dayside and nightside temperatures, respectively, which 
are observable. Therefore, we present our results in terms of these temperatures, 
which we calculate by summing over all dayside surface elements in the model: 
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int, 1S the intrinsic flux, where A; is the area of surface element 
jand a is the Stefan—Boltzmann constant. As a consequence of the distorted shape 
of the donor, T;,;—the temperature of each surface element in the absence of 
irradiation—is not constant. Similarly, the total flux that heats each surface element 
on the dayside, Fro. = Fin + (1 — Ap) Fir, where Ag is the bolometric (Bond) 
albedo of the donor. F,,,; is the irradiating flux seen by this element: 
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in which d; is the distance of surface element j to the white dwarf, Twp is the tem- 
perature of the white dwarf, Rwp is the radius of the white dwarf and Ap,o,; is the 
projected surface intercepting the irradiating flux. Because we have good estimates 
of the temperature of the white dwarf and its binary parameters, F;,,j; is well con- 
strained observationally. Therefore, the absorbed irradiating flux and the dayside 
temperature can equivalently be thought of as measures of the albedo. We deter- 
mine optimal values of (Tay), (Thight) and Ag by fitting the observed phase- 
dependent water-band and broadband fluxes using a Markov chain Monte Carlo 
procedure. The results are shown in the Extended Data Fig. 1. We find 
(Tight) = 2344719 K and (Taay) = 2,40177 K; the errors represent the 1o confi- 
dence levels. The corresponding constraints on the albedo and the intrinsic donor 
luminosity are Ag < 0.54 (at 20 confidence) and Lin, = (3.10.1) x 10~4L 9 (at lo 
confidence), respectively. In all of these models, we adopt a white-dwarf temper- 
ature of Twp = 13,200 + 200 K on the basis of our own analysis of ultraviolet data 
obtained by GALEX, an inclination of i= 84.36° and a distance to the system of 
d= 226 pc, and adopt an orbital period of P,.) =78.106657 min on the basis of 
previous eclipse modelling estimates?. We also adopt a fixed gravity-darkening 
coefficient of 3=0.08 (ref. 27). 

Heat redistribution across the terminator. Our geometric reprocessing model 
assumes that none of the irradiating flux is redistributed from the dayside to the 
nightside. This model fits the data acceptably, suggesting that redistribution is 
inefficient in the atmosphere of the donor. To estimate this efficiency quantita- 
tively, we need to adopt a specific model for redistribution. Here, we assume that 
a constant fraction 0 < € < 1 of the irradiating flux that is absorbed at any point on 
the dayside of the donor is redistributed evenly across the entire surface. Similar 
models are commonly used in studies of irradiated exoplanets”*. In the context of 
this simple model, the difference between the fluxes emerging from the dayside 
and the nightside is: 


D piay Figr, ‘Aj 
ae 


in which (o( Tae)" —o( Tight) Je _o is the flux difference of the equivalent model 
without any redistribution. The modelling described in the previous section pro- 
vides us with a set of self-consistent flux differences for models with € =0. We can 
therefore transform these to the flux differences predicted for otherwise equivalent 
models with € > 0. Because our earlier modelling also provides us with estimates of 
the observed dayside and nightside temperatures, we can estimate € by comparing 
the observationally inferred flux difference to that predicted by our transformed 
models with € > 0. The results are shown in Extended Data Fig. 2. We find that 
the efficiency of redistribution is indeed low, 0.1 < € < 0.4, with higher values 
corresponding to lower albedos. The 20 upper limit on the redistribution efficiency 
(for Ap=0) is € < 0.54. 

Sample size. No statistical methods were used to predetermine sample size. 
Code availability. The code used to generate the model, as well as the atmosphere 
templates, ICARUS, is available at http://icarus.renebreton.org. 
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Extended Data Figure 1 | Posterior probability distributions for the irradiation model parameters. Colour scale contours show the joint probability 
for every combination of parameters. Contours represent the la, 20 and 3a levels. Marginal posterior distributions are shown as histograms with the 
median and lo marked as dashed lines. The Ag distribution is quoted as a truncated distribution with a 20 upper limit. 
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Extended Data Figure 2 | Redistribution efficiency limits for the irradiated substellar donor. Allowed family solutions of redistribution efficiencies ¢ 
as a function of Bond albedo Az are shown. The black and grey contours represent the lo and 20 confidence levels, respectively. 
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Self-assembly of coherently dynamic, auxetic, 
two-dimensional protein crystals 


Yuta Suzuki!, GiovanniCardone!', David Restrepo’, Pablo D. Zavattieri?, Timothy S. Baker!? & F. AkifTezcan! 


Two-dimensional (2D) crystalline materials possess unique 
structural, mechanical and electronic properties!” that make them 
highly attractive in many applications*°. Although there have been 
advances in preparing 2D materials that consist of one or a few 
atomic or molecular layers®’, bottom-up assembly of 2D crystalline 
materials remains a challenge and an active area of development®!°. 
More challenging is the design of dynamic 2D lattices that 
can undergo large-scale motions without loss of crystallinity. 
Dynamic behaviour in porous three-dimensional (3D) crystalline 
solids has been exploited for stimuli-responsive functions and 
adaptive behaviour'!"'’. As in such 3D materials, integrating 
flexibility and adaptiveness into crystalline 2D lattices would 
greatly broaden the functional scope of 2D materials. Here we 
report the self-assembly of unsupported, 2D protein lattices with 
precise spatial arrangements and patterns using a readily accessible 
design strategy. Three single- or double-point mutants of the 
C4-symmetric protein RhuA were designed to assemble via different 
modes of intermolecular interactions (single-disulfide, double- 
disulfide and metal-coordination) into crystalline 2D arrays. 
Owing to the flexibility of the single-disulfide interactions, the 
lattices of one of the variants (©®RhuA) are essentially defect-free 
and undergo substantial, but fully correlated, changes in molecular 
arrangement, yielding coherently dynamic 2D molecular lattices. 
©>8RhuA lattices display a Poisson’s ratio of —1—the lowest 
thermodynamically possible value for an isotropic material— 
making them auxetic. 

Proteins are attractive building blocks for 2D materials because 
of their structural and chemical diversity, and inherent functions. 
Examples for natural, protein-based 2D materials include bacterial 
S-layer proteins and purple-membrane assemblies, which form crys- 
talline arrays in association with cell walls and membranes, respec- 
tively, and have been used in diverse technological applications!*". 
On the synthetic front, methods for 2D protein crystallization have 
been developed for the structural characterization of membrane 
proteins!®'” or functional applications'’, and generally rely on lipid 
layers as supports. Recently, 2D or 3D supramolecular protein arrays 
have been designed through the symmetric polymerization of protein 
building blocks via computationally designed protein-protein interac- 
tions or fusion of protein components!?-*”. However, these approaches 
are engineering-intensive and highly dependent on the accuracy of 
the design, and the integration of dynamic or adaptive behaviour has 
yet to be explored. 

To address these issues, we used a simple chemical-bonding strat- 
egy to control protein self-assembly. We reasoned that both cysteine 
(Cys)-mediated disulfide bonds and metal-coordination interac- 
tions between protein building blocks could produce crystalline 
and dynamic arrays with minimal design, because these bonds are: 
(1) strong but reversible (to minimize the surface area to be designed 
and ensure self-healing); (2) short, yet sufficiently flexible (to simulta- 
neously afford crystallinity and adaptiveness); (3) chemically tunable 


(to exert external control over self-assembly and enable stimuli- 
responsiveness); and (4) easily designed and engineered. 

The most straightforward route to obtaining 2D lattices is the 
tessellation of C3-, C4- or Ce-symmetric building blocks through 
appropriately positioned C,-symmetric linkages such as disulfide 
bonds or many metal-coordination interactions. As a model build- 
ing block, we chose t-rhamnulose-1-phosphate aldolase (RhuA), a 
C4-symmetric homotetramer (dimensions, 7nm x 7nm x 5nm) that was 
previously used as a synthon for supramolecular assemblies”? (Fig. 1a). 
An inspection of RhuA indicated protrusions in the four corner 
positions as ideal locations to incorporate one Cys or two histidine 
(His) residues for disulfide- or metal-directed self-assembly, respec- 
tively. We generated two variants, ©’°RhuA and #°?/"°8RhuA, with 
four conditionally self-associating corners for forming square lattices 
(Fig. 1a, b, top and middle rows). The relative positions of residues 
63 and 98 were deemed to be conducive to forming a bis-His metal- 
coordination motif to afford a tetrahedral or square planar coordi- 
nation geometry for metal-mediated RhuA pairing interactions. 
Additionally, we noted that RhuA could be converted into a stable 
D4-symmetric octamer via a single point mutation (A88F)**. Thus, 
we prepared a third variant, ***/©°*8RhuA, which presents eight 
symmetry-related cysteines at roughly 45° angles in a 2D projection 
(Fig. 1a, bottom row). We envisioned that the Cy-symmetric ©’°RhuA 
and 163/H°8RhuA variants could yield square lattices with two distinct 
patterns in terms of the orientation of the building blocks with respect 
to the 2D plane (Fig. 1b). On the other hand, the D, symmetry of 
F88/C8RhuA would dictate a 2D lattice with equivalent faces (Fig. 1b). 
These RhuA building blocks would also provide three distinct modes 
of inter-building-block interactions, whose effects on self-assembly 
we hoped to investigate: single-disulfide, double-disulfide and bis- 
His-anchored metal-coordination. 

For the oxidative self-assembly of ©’RhuA and F8°/O8RhuA, we 
tested various strategies including air oxidation, redox buffer systems 
containing reduced and oxidized glutathione (GSH and GSSG) or 
low concentrations (<10mM) of the reductant 3-mercaptoethanol 
(83ME), which slowly decomposes in aqueous solution and results in 
gradually more oxidizing conditions. In these experiments, solutions 
of purified ©8RhuA and F8°/“*8RhuA were first rapidly exchanged via 
repeated centrifugal filtration into solutions that varied in terms of 
their pH (6-8.5), buffering species (sodium phosphate, CHES, MES, 
MOPS, Tris; at 5-20 mM), the compositions of the redox buffer- 
ing system (1:19 or 19:1 GSH:GSSG; at 1 mM total concentration) 
or the concentrations of 3ME (0-10 mM). After adjustment to the 
desired final protein concentration (25-175 1M), these solutions were 
monitored for the formation of self-assembled structures by visual 
inspection for emergence of cloudiness and by transmission electron 
microscopy (TEM). Likewise, for metal-directed assembly, purified 
H63/H°8RhuA was exchanged into solutions with varying pH and buffer 
ions (to modulate metal binding kinetics and thermodynamics), with 
the exception that these solutions did not contain any reductants. 
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Figure 1 | RhuA constructs and their disulfide- and metal-mediated 
self-assembly modes. a, Schematic representations of the ©’°RhuA, 
Ho3/H°8RhuA and £8/C°8RhuA structures. The top and bottom halves 
of RhuA are coloured orange and blue, respectively, to highlight the 


Self-assembly was initiated by addition of 4-40 molar equivalents 
of Zn?* and Cu** (1-10 equiv. per bis-His motif), both of which 
can accommodate the desired four-coordinate geometries to link 
H63/H98RhuA building blocks at their corners. 

We found that the assembly of all variants into 2D arrays was robust 
and occurred under a wide range of conditions. As expected, crystal- 
line self-assembly was favoured by conditions that promoted slow, 
controlled oxidation or slow metal-binding kinetics. For example, 
uncontrolled air oxidation of ©’RhuA solutions or addition of large 
molar excess of Zn** and Cu?* to #°/°8RhuA samples resulted largely 
in amorphous aggregates in addition to some crystalline domains 
(Extended Data Fig. 1). The following solution conditions yielded 2D 
assemblies that were optimized in terms of size, crystallinity and yield: 
WeRhuA and"? RhwA (> 125 uM protein, 10 mM BME, pH7.5, 
10mM Tris); 2°" 8RhuA (25 uM protein, 200 uM ZnCl, pH7, 20mM 
MOPS). Under these conditions, “’RhuA reproducibly assembled 
into straight-edged, single- or few-layered 2D crystals that grew to 
several micrometres in size over several days (Fig. 2a). Negative-stain 
TEM, cryo-TEM, scanning electron microscopy (SEM) and atomic 
force microscopy (AFM) measurements revealed that the °®RhuA 
crystals were highly ordered and possessed uniform square or rec- 
tangular shapes with molecularly sharp boundaries (Fig. 2a, Extended 
Data Figs 2 and 3); these features have not been previously observed 
in designed 2D protein crystals!®”'. 2D crystals of #°/"°8RhuA 
(Fig. 2b, Extended Data Fig. 4) and '8*/8RhuA (Fig. 2c, Extended 
Data Fig. 5) were several hundred nanometres to one micrometre in 
size, but typically displayed irregular morphologies. #°/"?*RhuA lat- 
tices were monocrystalline, but tended to grow in three dimensions 
over time, whereas F8°/“*8RhuA crystals consisted of polycrystalline 
domains. 

As monitored by TEM and dynamic light scattering (DLS), self- 
assembly of all variants was reversible upon addition of high 
concentrations of either BME (>10mM) in the case of “°RhuA and 
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relative molecular orientations in b. Cys and His residues inserted into 
positions 98 and 63 are shown in red and cyan, respectively. b, Expected 
2D molecular arrangements of “8RhuA, #°/#°8RhuA and F8/CRhuA 
lattices. M?* refers to Zn”* or Cu’ ions. 


F88/C98RhuA crystals or ethylene-diaminetetraacetic acid (EDTA; 
>10mM) in the case of "°"#?8RhuA (Extended Data Fig. 6). The 
variant P*RhuA, which bears the native D98 residue and is devoid 
of corner Cys residues or the bis-His motifs, was not competent to 
self-assemble upon oxidation or metal addition (Extended Data 
Fig. 7a). These observations confirm the proposed modes of disulfide- or 
metal-mediated self-assembly. Two additional variants (©!37-RhuA 
and ©76°RhuA) were prepared with surface cysteines located on the 
sides of the tetramers (Extended Data Figs 7b, c). Theoretically, these 
variants could be envisioned to self-associate into close-packed 2D 
square lattices through disulfide linkages. Instead, they both formed 
non-crystalline aggregates upon oxidation, which we ascribe to the 
steric crowding around the C133 and C266 positions that precludes 
the orientation of RhuA molecules for planar self-assembly. These 
experiments suggest that the placement of Cys residues at steri- 
cally isolated, corner locations is preferable for the formation of 2D 
crystalline arrays. 

Exhaustive TEM analyses involving many crystals of each variant 
indicated p42)2 plane group symmetry for *RhuA and f8°/©8RhuA 
and p4 symmetry for #°/"°8RhuA crystals (Fig. 2, Extended Data 
Fig. 8a, b and Supplementary Table 4). These assignments were cor- 
roborated by electron diffraction data for ’RhuA and #/"8RhuA 
crystals (Extended Data Fig. 8c). Experimental 2D electron micros- 
copy projection maps agree very well with those calculated from 
structural models (Extended Data Fig. 9a) and confirm the expected 
protein-protein interaction geometries: corner-attached ©8RhuA 
and #°/H°8RhuA and side-attached °°/“*8RhuA (Fig. 2, columns 
(iv) and (v)). The plane group symmetries suggest that ©°RhuA 
tetramers adopt a checkerboard pattern (p42,2 symmetry), whereas 
H63/H98RhuA tetramers self-assemble in the same orientation with 
respect to the 2D plane (p4 symmetry). Together with the fact that 
F88/C98RhuA possesses two-fold symmetry parallel to the 2D plane, 
we thus obtain three distinct 2D molecular patterns with minimal 
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Figure 2 | TEM characterization of the 2D crystals of RhuA variants. 
a, “®RhuA; b, #%/H8RhuA (Zn*+ coordination); and c, *88/“8RhuA. (i) 
Low-magnification views, (ii) high-magnification views, (iii) Fourier 
transforms of column (ii), (iv) reconstructed 2D images, and (v) structural 
models based on the 2D reconstructions in column (iv) (colouring as 


alteration of a single protein building block (Extended Data Fig. 9b). 
©8RhuA and #°/H°8RhuA molecules self-assemble in distinct orien- 
tations in the 2D plane, suggesting that the energetic bias stemming 
from disulfide or metal bonding configurations and surrounding 
protein-protein interactions must be sufficiently large to favour one 
orientation over others to yield long-range order. 

Superior quality and larger sizes of *RhuA crystals compared to 
F88/C8RhuA and #°/H°8RhuA lattices are readily explained by interac- 
tions that direct the self-assembly of the variants. *8°/“?*RhuA presents 
a roughly circular distribution of eight cysteines (Fig. 1a, bottom row). 
This arrangement renders the desired double-disulfide-mediated, 
side-to-side self-assembly mode non-unique, and permits alternate 
attachment geometries between f8*/“?8RhuA molecules (Extended 
Data Fig. 5). Moreover, the increased Cys-valency of *8°“8RhuA leads 
to considerably stronger interactions and therefore ‘stiffer’ lattices: 
the dissolution of ¥°°/©?8RhuA polycrystallites requires substantially 
higher amounts of BME and longer incubation periods compared to 
“8RhuA lattices (Extended Data Fig. 6). Consequently, *°°/©"RhuA 
lattices display vacancies, as well as both high- (>30°) and low-angle 
(<10°) grain boundaries (Extended Data Fig. 5). #/"*8RhuA crystals 
possess fewer such defects, owing to the reversibility of Zn?* coordina- 
tion interactions. Yet, each #°?’2°8RhuA building block also contains 
numerous surface residues that can weakly coordinate Zn7* ions, pro- 
moting crystal growth in the third dimension (Extended Data Fig. 4a). 

In contrast to *88/“?8RhuA and #°/"°8RhuA, self-assembly of 
©*8RhuA is both chemically and orientationally specific. Moreover, 
the reversibility and inherent flexibility of single Cys-Cys linkages— 
containing five rotatable bonds—probably allows for the correction 
of any defects such as vacancies and grain boundaries, which would 
be difficult to accomplish in a rigid lattice composed of strongly inter- 
acting building blocks such as *8*/©"8RhuA. Indeed, as TEM images of 
mono-layered “*8RhuA crystals illustrate, the outcomes are (a) mac- 
roscopic crystal morphologies that reflect the molecular symmetry 
of the building blocks, (b) molecularly sharp crystal boundaries, and 
(c) lattices with extremely low defect frequencies. In the hundreds of 
monocrystalline “’RhuA lattices with surface areas of >1 1m? that we 
examined closely, we rarely found a lattice defect, despite the fact that 
these crystals grow in 3D space in an unsupported fashion. Even ina 
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in Fig. 1). The high-resolution limits in column (iii) are approximately 
14A. Unit cells are shown as black squares in column (v) (with corners 
placed at four-fold symmetry axes); unit cell constants (a = 3, y): P®RhuA 
(114A, 90°), #°°/°8RhuA (91 A, 90°) and F8/©8RhuA (115 A, 90°). 


rare instance such as that shown in Extended Data Fig. 9d, the defect 
frequency was one missing ©’°RhuA molecule within a lattice grid of 
about 9,000 molecules (about 0.6 m7). 

A more striking consequence of the disulfide bond flexibility is the 
coherent dynamic behaviour of “**RhuA crystals. Although our initial 
sample preparations of ©*’RhuA crystals yielded predominantly open 
lattices with large pores (Fig. 2a), we noticed that these crystal suspen- 
sions developed a dense sediment over a period of 1-3 days at 4°C 
(Extended Data Fig. 10a). TEM analysis of these sedimented crystals 
indicated a close-packed lattice arrangement (Fig. 3a, b, right panels). 
Upon resuspension of the sedimented crystals by repeated gentle 
mixing with a pipette and subsequent TEM imaging of the resulting 
samples, we captured a total of at least seven types of 2D ©’°RhuA 
crystals in distinct conformational states (I-VID) (Fig. 3a, Extended 
Data Fig. 10b). These conformational states were categorized by com- 
putational image analysis according to the roundness indices of the 
lattice pores, ranging from >0.85 for state I to <0.35 for state VII 
(see Methods). As evidenced by the retention of p42;2 symmetry and 
nearly equal unit-cell dimensions, these seven conformational states 
are clearly interconnected and implicate a continuous lattice motion 
between fully open and fully closed states (Supplementary Video 1). 
These large amplitude motions of the “’RhuA lattices are afforded by 
a remarkable extent of hinging about the flexible disulfide linkages and 
their placement at corner locations of RhuA molecules. The transition 
from the open to the closed state is accompanied by the compression 
of the inter-°RhuA hinge angle (a) from >80° to 17°, a decrease of 
the pore size from approximately 4.4nm to 1.0 nm (for the passage of 
a spherical object) and an increase in the relative protein/hole surface 
density of 170% (Fig. 3b, Table 1). 

Conformational dynamics of ©’*RhuA crystals are fully coherent: in 
each crystal examined, only one type of conformational state (I-VI) 
was observed throughout the lattice (Fig. 3b). This observation implies 
that any mechanical deformation of a ©’*RhuA crystal is cooperatively 
propagated along the 2D plane, enabled by both the flexibility and 
short linker-length of single-disulfide linkages: longer, more flexi- 
ble linkages would preclude coherent dynamics, whereas inflexible 
linkages would lead to non-adaptive lattices. Indeed, ***/“*8RhuA and 
H63/H°8RhuA lattices do not display any apparent dynamic behaviour 
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Figure 3 | Dynamic and auxetic nature of 2D ©®RhuA crystals. 

a, Reconstructed 2D images of seven distinct conformational states (I- VII) 

of 2D ©$RhuA crystals. b, High-magnification views and derived structural 
models of conformations II, V and VII. Unit cells and hinge angles (a) 
between ©*’RhuA units are highlighted in black and red, respectively (orange 
and blue colouring as in Fig. 1). c, Population distributions of °’RhuA crystals 
in different conformational states during repeated resuspension/sedimentation 
cycles. n refers to the total number of individual lattices analysed in each panel. 
d, Schematic representation of the rotating, rigid-square model that describes 
the 2D ©$RhuA lattice dynamics (colouring as in Fig. 1; the building blocks 
are numbered for clarity). Ax and Ay denote changes in the transverse- and 
longitudinal-dimension lengths upon lattice opening and closing. e, Digital 
image-correlation analysis of reconstructed TEM images of ©’RhuA crystals 
for determining auxetic behaviour. Representative volume elements (RVEs) in 
the lattices of the two extreme states (I and VII) are indicated with red squares, 
with the vertices of the RVEs numbered 1-4, and the vectors M, N (in state I) 
and m, n (in state VII) used to calculate local engineering strains are shown 
with cyan arrows. The edges of the lattice pores, used for determining the 
positions of the RVE vertices, are shown as purple lines. f, Calculated Poisson's 
ratios (v) of lattice conformational states with respect to state I. The error 

bars correspond to the uncertainties in pixel selection during digital image 
processing (see Methods for details). 
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Table 1 | Structural parameters of °9°RhuA crystals in states II, V 


and VII 

State Il State V State VII 
Unit cell (a=b,7¥) 114.4A,90° 110A,90° 107.8A,90° 
Plane group symmetry p42,2 p42,2 p42,2 
Intertetramer hinge angle, a 79° 49° 17 
Protein surface area per unit 82.7 nm? 86.2nm? 88.0nm? 
cell, Aprotein 
Pore surface area per unit 44.7 nm? 34.7 nm? 28.2 nm? 
cell, Apore 
Relative protein/pore density, 1.9 Zo od 
Aprotein/Apore 


because their double-disulfide- or metal-mediated modes of assem- 
bly do not allow rotation of the neighbouring protein molecules with 
respect to one another. 

To establish that the conformational dynamics of “*RhuA crystals are 
reversible, we subjected them to repeated sedimentation/resuspension 
cycles within the solutions in which they self-assembled at 4°C. In 
each cycle, we obtained TEM images of >100 individual crystals from 
the same container, collected from either the sediments that formed 
overnight or the suspensions immediately after mixing the sediments 
by repeated pipetting. As shown in Fig. 3c, the conversion between 
the closed states (VI and VII) and open or intermediate states (I-V) 
is completely reversible and absolute. Sedimented samples do not 
contain any open or intermediate states and resuspended samples do 
not contain any closed states, providing unambiguous evidence that 
“8RhuA lattices are reversibly dynamic. On the basis of the observed 
distribution profiles, the opening of the lattices by mechanical agita- 
tion and their distribution among conformational states I-V appear 
to be immediate (at least within the ~5-min timescale of TEM sam- 
ple preparation), whereas their full closure takes several hours. We 
posit that the energetic barriers between different “®RhuA lattice 
conformations (due to different disulfide bond configurations and 
long-range protein-protein interactions) must be small enough to be 
overcome by mechanical agitation and internal protein dynamics. The 
fully closed conformation (state VII) appears to be a kinetically sta- 
ble conformation that accumulates over time in unagitated solutions 
at 4°C. This kinetic stability may be ascribed to the dense protein 
packing interactions in state VII and the resulting restriction of the 
dynamics of both the lattice and the individual building blocks. 

Geometric considerations based on the retention of p42;2 symme- 
try (Fig. 3b, d) as well as a strain analysis of the seven conformational 
states by digital image correlation (Fig. 3e, Extended Data Fig. 10c) 
indicate that ©’’RhuA crystals are auxetic and have a Poisson’s ratio 
of v= —1.00 £0.01 (Fig. 3f). Poisson's ratio is a scale-independent 
metric that describes the response of a material to strain; it is defined 
as the ratio between transverse (e,) and longitudinal (e,) strains under 
uniaxial loading”. The strains e, and ey are, in turn, approximated by 
changes in material length in the transverse (Ax) and longitudinal 
(Ay) directions (Fig. 3d)**: 


as 
—— Ax 


where —1 <v<0.5 for an isotropic 3D material and —1<v<1 for 
an isotropic 2D material*®. Most materials have positive v; that is, 
they become thinner in the longitudinal direction when stretched 
transversely’. By contrast, materials with negative v (that is, 
auxetic materials) display the counterintuitive behaviour of longitu- 
dinal expansion upon transverse stretching. Thus, auxetic materials 
can be expected to possess enhanced toughness, resistance to inden- 
tation and shear stiffness as well as favourable damping and acous- 
tic response, and have been proposed for use in protective armours, 
smart textiles, actuated filtration, and piezoelectric and biomedical 
devices*?’”-*?, Although materials with negative v exist, most fall in 
the range of —0.4 << 0, with the lowest reported values of —0.7 
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to —0.8 observed in re-entrant foams*”*!. It has been postulated”® 
that for a 2D lattice of rotating rigid squares with flexible hinges, 
v should be equal to the lowest thermodynamically permissible value 
of —1 at all rotation angles. “*’RhuA lattices represent a true reali- 
zation of this theoretical model, and the first isotropic material with 
v = —1 designed and constructed at the molecular scale. Assuming 
the model of ref. 26, we calculate that “°*RhuA crystals should shrink 
or expand simultaneously in the x and y dimensions by at least 24% 
during the conversion between fully open (J) and fully closed (VII) 
states (Extended Data Fig. 10d). 

We envision that upon hierarchical assembly via physical methods 
or incorporation into polymeric materials using chemical strategies, 
molecular architectures such as 2D ©*8RhuA lattices may be used as 
feedstocks for adaptive and auxetic macroscopic materials. In gen- 
eral, our study underscores the utility of dynamic covalent bonds in 
the construction of highly ordered, yet adaptive, protein materials. 
Specifically, owing to their high structural quality and chemically 
tunable assembly under ambient conditions, “’RhuA crystals pro- 
vide a unique medium for studying molecular self-assembly and crys- 
tallization as well as for investigating the energy landscape of lattice 
dynamics. The resultant understanding of structural dynamics at the 
nanoscale should greatly aid the fabrication of functional materials. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Design of RhuA variants and site-directed mutagenesis. RhuA variants were 
designed as described in the main text on the basis of previously reported crystal 
structures (Protein Data Bank (PDB) 1GT7”? for “*®RhuA, #/#*8RhuA, P*?RhuA, 
©133RhuA and ©°°RhuA, and PDB 2UYU” for *°8/C8RhuA. All RhuA variants 
also contain the mutations E192A and C126S as previously reported”>4. The 
gene for ©’*RhuA, pre-inserted into the pJ414 expression vector optimized for 
expression in E. coli, was purchased from DNA2.0. *8°/°8RhuA, #/"°8RhuA, 
D°8RhuA, ©3RhuA and ©7RhuA were prepared using QuikChange mutagen- 
esis (Stratagene) with primers obtained from Integrated DNA Technologies 
(Supplementary Table 1). The mutant plasmids were transformed into XL-1 Blue 
E. coli cells followed by purification using the QIAprep Spin Miniprep kit (Qiagen). 
The presence of the mutations was verified by sequencing (Retrogen). Amino acid 
sequences of RhuA variants are shown in Supplementary Table 2. 

Protein expression and purification. Bacterial expression of RhuA variants 
was performed according to previously published procedures with slight mod- 
ifications*”. Plasmids bearing the variants were transformed into BL21(DE3) 
E. coli cells, and the colonies were grown overnight at 37 °C on lysogeny broth 
(LB) agar plates (pH 7.4) containing 100 mg]! ampicillin. Starter cultures (5 ml 
with 100mg1~! ampicillin) from single colonies were grown for about 4-6h at 
37°C (with shaking at 250 r.p.m.) before inoculation into 1-l LB cultures con- 
taining 100 mg1~' ampicillin. After the cells were grown to an optical density of 
about 0.8 at 600 nm, protein expression was induced with 1 mM isopropyl 3-D-1- 
thiogalactopyranoside (IPTG, Gold Biotechnology) for 12-13 h (at 37°C with 
shaking at 250 r.p.m.). Cells were pelleted by centrifugation (5,000 rp.m. at 4°C 
for 10 min), and resuspended in a buffer solution containing 10 mM tris(hydrox- 
ymethyl)aminomethane hydrochloride (Tris; pH 7.5), 1mM ZnCl, and 10 mM 
B-mercaptoethanol (8ME). For #°?/"*8RhuA preparations, ZnCl, was excluded 
from buffer solutions to avoid possible protein precipitation during purification. 
Cell lysis was performed by sonication for 15 min on ice. The lysis solution was 
centrifuged for 30 min at 12,000r.p.m. at 4°C. Polymin-P (Acros) was added to the 
supernatant at a final concentration of 0.15% (w/v) for nucleic acid precipitation 
and the resulting mixture was stirred for 30 min before centrifugation for 30 min at 
12,000 r.p.m. at 4°C. The supernatant was loaded onto a DEAE-Sepharose CL-6B 
(GE Healthcare) column and eluted using a gradient of 0-500mM NaCl in Tris 
buffer at 4°C. Fractions containing RhuA, which eluted at approximately 200 mM 
NaCl, were added to 1.7 M ammonium sulfate, and were centrifuged for 45 min at 
12,000 r.p.m. at 4°C. The precipitate was dissolved in a buffer solution containing 
5mM sodium phosphate (NaPi; pH 7.2), 1mM ZnCl, and 10 mM BME, and then 
dialysed three times against 51] of the same buffer solution. Further purification was 
done using either a High Q cartridge column (BioRad; at pH 8.0) using a 0-500 mM 
NaCl gradient or a Mini CHT Type I hydroxyapatite column (BioRad; at pH 7.2) 
using a 5-500 mM NaPi gradient on a DuoFlow fast protein chromatography work- 
station (BioRad). Pure protein fractions eluted at 350 mM NaCl and 200 mM NafPi, 
respectively. These fractions were combined, dialysed three times against a buffer 
solution of 10mM Tris (pH 7.5), 1mM ZnCl, and 10mM BME, and concentrated 
in an Amicon stirred cell at 4°C. The protein was then flash frozen and kept at 
—80°C until use. Protein purity was confirmed by SDS PAGE (Supplementary 
Fig. 1) and ESI-MS (Supplementary Fig. 2 and Supplementary Table 3), and stock 
concentrations were determined as previously reported’. 

Preparation of 2D RhuA crystals. All 2D RhuA crystals reported here were 
obtained in an unsupported fashion in solutions. As described in the main text, 
several solution conditions were screened for optimizing the formation of 2D pro- 
tein crystals of ©’’RhuA, *°/8RhuA and #°/4°8RhuA variants. For all variants, 
these screening conditions included various pH values (6-8), starting protein 
concentrations (25-175 1M), buffer salt concentrations and identities (5-20 mM 
bis-Tris, CHES, MES, MOPS, NaPi and Tris) and temperatures (4-37 °C). For the 
disulfide-mediated self-assembly of @’RhuA and '°*/8RhuA, we additionally 
screened various oxidizing conditions (1-2 mM total GSH/GSSG at ratios var- 
ying from 1:19 to 19:1, or 5-10 mM ME, which slowly decomposes in aqueous 
solution). For the metal-mediated assembly of "/"°8RhuA, we screened 4-20 
molar equivalents (over tetrameric protein concentration) of ZnClz and CuCl. 
As stated in the main text, the best conditions were: ®RhuA and F8°/°8RhuA 
(>125.M protein, 10mM BME, pH7.5, 10 mM Tris); #°/"°8RhuA (25 uM 
protein, 200 1M ZnCl, pH 7, 20mM MOPS); all at 4°C. In general, the forma- 
tion of crystals was either immediate (upon metal addition for #©/"°8RhuA) or 
lasted overnight to several days (for ©°8RhuA and '°8/?8RhuA). Crystal forma- 
tion resulted in increasing cloudiness of the self-assembly solutions. The growth 
of ©°8RhuA crystals could be accelerated by gentle shaking (Extended Data 
Fig. 2a, b). In additional experiments, we tested the thermo- and chemo-stability 
of °$RhuA crystals, as shown in Supplementary Fig. 3. 

Electron microscopy. For negative-stain TEM sample preparation, 3-3.5-l 
aliquots of crystal suspensions were applied onto negatively glow-discharged 


carbon-coated Cu grids (Ted Pella, Inc.), washed with Milli-Q water, and stained 
with 1% uranyl acetate at 4°C. For cryo-EM sample preparation, 3-3.5-l 
aliquots of 251M ©8RhuA sample (diluted sample) were deposited onto negatively 
glow-discharged Quantifoil grids, and then plunged into liquid ethane after blot- 
ting. The sample was then stored under liquid nitrogen until analysis. Sample 
screening was performed in an FEI Sphera transmission electron microscope 
equipped with a LaBg electron gun at 200 keV and imaged on Gatan 2K? CCD. 
A complete electron crystallographic analysis was done on 43, 50 and 33 micrographs 
for ©$RhuA, #/"°8RhuA and '8/C8RhuA, respectively, which were selected by 
visual assessment. Micrographs were each processed separately using the 2dx soft- 
ware>’, which implements a semi-automatic processing pipeline that is mostly 
based on programs from the MRC suite*‘. The program allows one to determine 
the 2D plane group lattice symmetry of each crystal, to calculate its unit-cell dimen- 
sions and to generate a projection density map. The processing involves estimation 
of the contrast transfer function (CTF) using CTFFIND3*, spot-list determination, 
automatic lattice determination, crystal masking, unbending and generation of a 
projection map after CTF correction. Within this program, the plane group sym- 
metry is determined using ALLSPACE”* by comparing the internal phase residual 
for each symmetry to its expected value. Lattice unit-cell dimensions were deter- 
mined on a subset of images, screened by the quality of the reflections—12, 
44 and 13 for ©*RhuA, #/"*8RhuA and f8/8RhuA, respectively. The estimated 
resolution limit of each image (15-30 A) was assessed by both visual inspection of 
its computed Fourier transform and comparison with simulated images filtered at 
different levels of resolution. 

AFM measurements. For AFM sample preparation, 2 1l of crystal suspensions 
were applied on freshly cleaved mica, washed with deionized water and dried. 
AFM images were collected using a Dimension Icon microscope (Bruker) using 
Scan Asyst peak force tapping (in air) mode at a resolution of 512 lines per image 
and a scan rate of 0.3-0.5 Hz. The AFM images obtained were processed using 
Nanoscope Analysis (Bruker). 

SEM measurements. For SEM sample preparation, 2 1] of crystal suspensions were 
applied onto SiO, substrates. The sample was sputter-coated with iridium oxide 
before analysis to reduce charging effects. SEM micrographs were collected on an 
FEIXL 30UHR scanning electron microscope operated at 5 keV. 

DLS measurements. DLS experiments were performed using a Wyatt DynaPro 
NanoStar instrument. The experiment runs were performed to collect 10 acqui- 
sitions with 657-nm excitation at a power setting of 100%. Measurements were 
plotted using a Rayleigh sphere model. Peak radius cut-offs were fixed according 
to default settings (0.5-10,000 nm). 

Structural modelling and simulations. Projected electron-density maps were 
used as a reference in the visualization software UCSF Chimera*’ to determine 
the orientation of the RhuA molecules in each crystal. The atomic coordinates 
were extracted from PDB entries 1GT7 (C,-symmetric tetramer for ’RhuA and 
Ho3/H98RhuA) and 2UYU (D4-symmetric octamer for F°°/©°8RhuA) and placed 
manually to fit the position of one subunit in the observed projected density map. 
The orientation of the molecule was refined in an iterative manner by comparing 
the experimental density with the density simulated from a model of the unit cell, 
obtained as explained below. Simulated projected electron-density maps were com- 
puted using Bsoft (http://Isbr.niams.nih.gov/bsoft)** and EMAN2 (http://blake. 
bcm.edu/emanwiki/EMAN2)°*”. Starting from the estimated position and orien- 
tation of a single subunit, multiple copies of the model were generated to cover 
one unit cell (bmoledit in Bsoft) and converted to electron density (e2pdb2mrc.py 
in EMAN2) with a resolution limit of 30 A. Finally, the density volume was 
projected along the vertical direction (bproject in Bsoft) to generate a 2D density 
map comparable to the experimental map. 

Classification of the conformational states of 2D ©®RhuA crystals. TEM micro- 
graphs of “’’RhuA crystals were analysed computationally to categorize different 
conformational states of the 2D lattice. Essentially, for each raw image we deter- 
mined a ‘roundness index’ number derived from the shape of the pores in the 
lattice, and used this value to classify the images. The analysis was performed 
on 982 images using an ad hoc algorithm coded as a macro in Fiji (http://fiji.sc/ 
Fiji)” and ImageJ (http://imagej.nih.gov/ij)*'. Each image was initially processed 
by applying a 3 x 3 mean filter three times, and then converted to a binary image 
after automatic thresholding to segment out the pores. The mask was then filtered 
using the opening morphological operator to smooth the shape of the pores and 
to remove small outliers. Pores that were distorted from potential bending of the 
crystal in some regions of the field of view were discarded using their size for 
screening. For images acquired at nominal magnifications of x 50,000 and 68,000 
(calibrated pixel size of 2.033 A and 1.646 A, respectively), only pores with areas 
that covered 500-1,200 pixels were kept. The remaining pores were modelled as 
ellipses and their roundness index was determined as the ratio between the lengths 
of the major and the minor elliptical axes. A single index value was assigned to each 
micrograph as the average of all the segmented pores. The images were assigned 
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to one of the seven identified states using the following criterion: state I, >0.85; 
state II, 0.75—0.85; state IIT, 0.65-0.75; state IV, 0.55-0.65; state V, 0.45—0.55; state 
VI, 0.35-0.45; state VII, <0.35. 

Generation of video simulating the lattice motions. Supplementary Video 1 was 
generated to illustrate the dynamics of 2D ©$RhuA crystals starting from the TEM 
snapshots of the seven conformational states (I-VII; Fig. 3a). All projected density 
images were rescaled to include the same field of view (bint and bimg from Bsoft). 
Twenty-four intermediate image frames were created between each pair in the 
sequence by pseudo-morphing, using the program convert from the software suite 
ImageMagick. Finally, the video was generated with ffmpeg (http://www.ffmpeg. 
org) by combining the images as frames. 

Digital image correlation for determining Poisson’s value. Using Matlab, we per- 
formed digital image processing of the reconstructed 2D TEM images of dynamic 
©8RhuA crystals for evaluating Poisson's ratios. As a first step, histogram normal- 
ization was performed, taking the image of conformational state I as a reference to 
normalize the intensity of the images (Extended Data Fig. 10c). This normalization 
guarantees that the thresholds used for pixel selection have the same meaning for 
all the images. After normalizing, we proceeded to select the representative volume 
element (RVE) in each of the images. The RVE considered has a rectangular section 
whose vertices correspond to the centroid of the lattice pores (shown as green rec- 
tangles in Extended Data Fig. 10c and red rectangles in Fig. 3e). To find the position 
of the centroids, we determined the edges of the pores using the Sobel edge-detection 
method. The edges of the pores are shown as purple lines in Extended Data Fig. 10c 
and Fig. 3e. Then, centroid calculation was performed by taking the mean of the 
coordinates in x and y for each pixel in the border. After the selection of the RVE, 
the size of the RhuA building blocks was measured on each image by defining the 
circle that is circumscribed to each square (shown as blue circles in Extended Data 
Fig. 10c). Because the square shape of the RhuA building blocks can be assumed 
to remain rigid in each conformational state, we determined that the images were 
at slightly different scales; thus, appropriate magnification factors were calculated 
taking the image corresponding to conformational state I as a reference. The applied 
scaling factors are shown in Extended Data Fig. 10c. We then calculated the posi- 
tion of the vertices of the RVE on each conformational state and proceeded with 
the calculation of Poisson's ratio. For each RVE, local values of engineering strains 
e, and ey are calculated from the vectors M, Nand m, n that span the RVE on its refer- 
ence (state I) and deformed configurations (states II-VII), respectively (see Fig. 3e): 


Nx 


n= 
> 
Ny 


The deformed and undeformed configurations are related through the deformation 
gradient F: 
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where: 
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Homogenized values of the engineering strains for the RVE are calculated as: 
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Finally, Poisson’s ratio is calculated as: 
e 
eS 
ex 


To determine the uncertainties in v (error bars in Fig. 3f), we calculated the 
local standard deviation given the distribution of pixel intensity of each Fourier 
transform image (using the command stdfilt in Matlab). From this local standard 
deviation, we obtained the uncertainty in the intensity of each pixel that forms the 
edges of the pores. These uncertainties were propagated in calculating the positions 
of RVE vertices and finally in calculating the Poissons ratios. 
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Extended Data Figure 1 | TEM characterization of ©®RhuA and #%/"°8RhuA self-assembly under non-optimal conditions. a, 251M ©*8RhuA was 


incubated at 4°C under air exposure for 1 day; the solution contained 20 mM Tris (pH 8) and no reductants or oxidants. b, 25 1M #°?/"°8RhuA was 
incubated in the presence of 1 mM ZnCl, at 4°C for 1 day; the solution contained 20 mM MOPS (pH7). 
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Extended Data Figure 2 | TEM characterization of optimized 2D ©*RhuA crystals. a, 125\.M ©*RhuA was incubated in the presence of 10 mM BME 


at 4°C in a standing solution for 3 days. b, 125 1M ©$RhuA was incubated in the presence of 10 mM BME at 4°C with gentle shaking for 3 days, followed 
by 2 days at rest, during which a dense precipitate of crystals formed. 
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Extended Data Figure 3 | Additional structural characterization of 2D ©*RhuA crystals. a, AFM; b, SEM; c, Cryo-TEM. The middle column in a 
shows the profiles along the arrows in the left column. 
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Extended Data Figure 4 | TEM characterization of 2D #/"*8RhuA crystals. a, b, 254M #°/"°8RhuA was incubated with 200 1.M ZnCl (a) or CuCl, (b) 
ina20mM MOPS buffer solution (pH 7.0) at 4°C for 1 day. 
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Extended Data Figure 5 | TEM characterization of 2D *8*/©8RhuA crystals. a, 125 1M *8°/©°8RhuA was incubated in the presence of 10 mM ME at 
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4°C with gentle shaking for 2 days. b, Schematic representations of various types of defects observed in F8°/“°8RhuA crystals, corresponding to boxed 
areas (I-IV) in a (colouring as in Fig. 1). 
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after addition of ME. Right panels, 125 1M ©8RhuA (a) or 8°/°8RhuA mediated self-assembly conditions (in the presence of 200 1.M ZnCl, 
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presence of 10mM BME at 4°C with gentle shaking) and self-assembly was _ (centre) and upon addition of 10 mM EDTA (bottom). Scale bars are 51m 
monitored by DLS at t=0 (top), t=3 days (centre) and upon addition of in all panels. 
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Extended Data Figure 7 | Characterization of the self-assembly of (bottom panels). c, Possible mode of disulfide-mediated self-assembly 
>°8RhuA, ©?3RhuA and ©76RhuA variants by TEM and DLS. a, TEM of ©76°RhuA (top panels) and TEM characterization of the self-assembly 
(right panels) and DLS (left panels) characterization of P?’RhuA self- products obtained under conditions that were optimized for ©’RhuA 
assembly under oxidative or metal-mediated self-assembly conditions (bottom panels). No crystalline assemblies were detected under these 

that were optimized for “’RhuA; see main text or Extended Data Figs 2 conditions for any of these three variants (or other conditions that were 
and 4 for details. b, Possible mode of disulfide-mediated self-assembly used for screening “*8RhuA self-assembly). Colouring in the top panels of 
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products obtained under conditions that were optimized for ©’°’RhuA 
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No. of images used 43 (12) 50 (44) 33 (13) 
for analysis 


Unit cell dimensions 
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Extended Data Figure 8 | Crystallographic analysis of the 2D lattices of 
RhuA variants. a, (i) Representative Fourier transforms calculated from 
the full field of view of crystal images, displayed up to 10 A. Reciprocal 
lattice axes are indicated with H and K. The reflections are consistent 

with the plane group symmetries estimated from the analysis of the phase 
residuals (see Supplementary Table 4). (ii) Integer quality (IQ) plots 
calculated from the spectra in column (i). The size of the boxes around 
the reflections reflects their IQ value, defined as the ratio of the reflection- 
peak amplitude and the amplitude of the background signal around each 
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peak”. The most significant reflections are labelled with their IQ values, 
1-4. Rings are displayed at resolutions of 30 A, 15 A and 10A. (iii) Overlap 
of fast Fourier transform and IQ plots. b, Crystallographic data for the 2D 
lattices of RhuA variants. Reported are the numbers of TEM images used 
for the analysis of plane group symmetry. The numbers in parentheses are 
the number of images used for determining the statistics for the unit-cell 
dimensions of each lattice. The plane group symmetry was determined by 
consensus from all the available images. c, Electron diffraction patterns of 
©8RhuA and 7%" 8RhuA lattices. 
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Extended Data Figure 9 | 2D TEM reconstruction and structural c, Representative images of 2D ©*8RhuA crystals containing no defects. 
modelling of the 2D lattices of RhuA variants. a, Comparison of the d, Representative images of 2D “8RhuA crystals containing a single 
observed (experimental) and simulated projection maps of the 2D lattices _lattice vacancy. The stain artefact images are shown to highlight areas that 
of RhuA variants. b, Molecular arrangements in ©’RhuA, 2°" 8RhuA may appear to contain lattice vacancies in low-magnification views, but 


and F88/C8RhuA lattices viewed from different angles (colouring as in Fig. 1). actually do not. 
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each conformational state. Red lines represent the border of the pores 
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Iron(111)-catalysed carbonyl-olefin metathesis 


Jacob R. Ludwig!', Paul M. Zimmerman!, Joseph B. Gianino! & Corinna S. Schindler! 


The olefin metathesis reaction of two unsaturated substrates is 
one of the most powerful carbon-carbon-bond-forming reactions 
in organic chemistry. Specifically, the catalytic olefin metathesis 
reaction has led to profound developments in the synthesis of 
molecules relevant to the petroleum, materials, agricultural and 
pharmaceutical industries’. These reactions are characterized by 
their use of discrete metal alkylidene catalysts that operate via a 
well-established mechanism. While the corresponding carbonyl- 
olefin metathesis reaction can also be used to construct carbon- 
carbon bonds, currently available methods are scarce and severely 
hampered by either harsh reaction conditions or the required 
use of stoichiometric transition metals as reagents. To date, no 
general protocol for catalytic carbonyl-olefin metathesis has been 
reported. Here we demonstrate a catalytic carbonyl-olefin ring- 
closing metathesis reaction that uses iron, an Earth-abundant 
and environmentally benign transition metal, as a catalyst. This 
transformation accommodates a variety of substrates and is 
distinguished by its operational simplicity, mild reaction conditions, 
high functional-group tolerance, and amenability to gram-scale 
synthesis. We anticipate that these characteristics, coupled with the 
efficiency of this reaction, will allow for further advances in areas 
that have historically been enhanced by olefin metathesis. 
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Classified by their mechanistic profile, metathesis reactions most 
often proceed through a [2+2] cycloaddition to form a 4-atom cyclic 
intermediate, followed by [2+2] cycloreversion, resulting in an 
exchange of reactive partners*®. Included in this mode of reactivity 
are classical olefination reactions (Fig. 1a), in which highly polarized 
nucleophilic components add to carbonyl derivatives to give cyclic 
intermediates’ ’. These compounds then fragment to produce olefins 
and a metal or main-group oxo species, the latter being the thermo- 
dynamic driving force for the reaction. The utility of this chemistry is 
often hampered by harsh conditions, the need for pre-functionalized 
intermediates and the production of excess quantities of waste 
products. A revolutionary development in this regard was the estab- 
lishment of the catalytic olefin—-olefin metathesis reaction, in which 
transition-metal alkylidenes facilitate [2+2] cycloadditions and sub- 
sequent [2+2] cycloreversions from simple olefin starting materials 
(Fig. 1b). Since its discovery, olefin metathesis has led to marked 
advances in the fields of organic chemistry as well as chemical bio- 
logy, and allowed entry into previously inaccessible complex molecules. 
Although it was discovered at around the same time, the corresponding 
carbonyl-olefin metathesis (Fig. 1c) is much less developed”’. To date, 
no catalytic protocol of general synthetic utility exists for carbonyl- 
olefin metathesis reactions, and, furthermore, available procedures 
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Figure 1 | Olefination and metathesis reactions for the formation 
of alkenes. a, Summary of traditional olefination reactions in organic 
synthesis. b, Olefin-olefin metathesis catalysed by ruthenium, 
molybdenum and tungsten complexes. c, Modes of carbonyl-olefin 
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rely on harsh reaction conditions or the use of stoichiometric metal 
reagents. The earliest reported examples for carbonyl-olefin metath- 
esis reactions involve a two-step sequence including a photoinduced 
[2+2] cycloaddition, followed by a subsequent acidic and/or thermal 
fragmentation step to induce cycloreversion to the corresponding 
metathesis products (Fig. 1c)!°-!*. The success of this photolysis— 
pyrolysis series is often impeded by harsh reaction conditions, which 
are prone to result in competing polymerization or regeneration of 
starting carbonyl-olefin pairs. Importantly, stoichiometric amounts of 
catalysts originally developed for olefin metathesis (Fig. 1b) were found 
to be able to mediate carbonyl-olefin metathesis reactions. In this case, 
the cycloreversion step results in the formation of a kinetically inert 
metal-oxo complex, precluding regeneration of the metal alkylidene 
and prohibiting catalyst turnover (Fig. 1c)”’. 

Alternatively, titanium-based Tebbe and Petasis reagents have 
been used in excess to induce initial olefin metathesis and sub- 
sequent carbonyl olefination reactions, resulting in the formation of 
carbonyl-olefin metathesis products'*!. In addition to this titanium 
alkylidene approach, Bronsted and Lewis acids are also reported to 
effect carbonyl-olefin metathesis reactions of simple acetophenone 
and benzaldehyde derivatives, although limited in scope'®'?. An 
important contribution in carbonyl-olefin metathesis reactions was 
reported recently, wherein an organocatalysed (3+2)/retro-(3+2) 
cycloaddition process allowed the ring-opening metathesis of cyclo- 
propenes to give -y,5-unsaturated aldehydes, which constitute formal 
carbonyl-olefin metathesis products (Fig. 1c)*°. Although a remark- 
able advance, only highly strained cyclopropenes are applicable as 
olefinic components. As such, a general catalytic protocol has yet to 
be developed. Despite the development of the methods described 
earlier, the only protocol for carbonyl-olefin metathesis that has 
been applied to the synthesis of biologically active natural products 
and pharmaceutically relevant compounds”! is the stoichiometric 
metal-mediated approach (Fig. 1c), further illustrating the need 
for a general catalytic system. We describe a Lewis-acid-catalysed 
carbonyl-olefin ring-closing metathesis reaction for the direct 
synthesis of functionalized carbocycles, representing a major advance 
in carbonyl-olefin metathesis. 

At the outset of our investigations, we envisioned an efficient design 
principle for catalytic carbonyl-olefin metathesis based on the in situ 
formation of oxetanes as reactive intermediates (II) (Fig. 1c). This 
approach fundamentally differs from stoichiometric carbonyl-olefin 
metathesis protocols proceeding via intermediate metalla-oxetanes 
using metal-alkylidene reagents. While these result in the formation 
of kinetically inert metal-oxo complexes upon retro-cycloaddition that 
ultimately prohibit a catalytic process, the corresponding oxetane inter- 
mediates (II) are able to undergo direct fragmentation to the desired 
carbonyl-olefin metathesis products, thus enabling catalyst turnover. 
This conceptually simple design principle for catalytic carbonyl-olefin 
metathesis relies on several fundamental requirements to achieve a 
viable synthetic strategy. In particular, a potent catalyst needs to be 
identified that has the capability of efficiently promoting both the 
[2+2] cycloaddition as well as the subsequent [2+2] cycloreversion 
step. Additionally, potential competing side reactions upon [2+2] 
cycloreversion, such as polymerization and regeneration of starting 
materials, need to be reduced. Similarly, the control of chemoselectivity 
is challenging, as unsaturated ketone substrates are prone to undergo 
different modes of reactivity, including carbonyl-ene, Prins and alkyl- 
ation reactions of enolate intermediates**~**. Initially, we hypothesized 
that metal-derived Lewis acids could overcome these constraints 
and function as effective catalysts to activate the carbonyl for both 
initial [2+2] cycloaddition as well as subsequent fragmentation of 
the resulting oxetane to yield the desired carbonyl-olefin metathesis 
products (III; Fig. 1c). However, only isolated literature reports exist 
for Lewis-acid-mediated oxetane formation using either equimolar 
amounts of titanium(1v) or tin(1v)?>’. In comparison, Lewis-acid- 
promoted oxetane fragmentation was found to be more prominent, 
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Figure 2 | Initial evaluation of the catalytic carbonyl-olefin metathesis 
reaction. a, Stoichiometric Lewis acid evaluation. All reactions were 
performed using 0.18 mmol $-ketoester 1, 0.18 mmol Lewis acid I 
dichloromethane (0.1 M) at room temperature (RT) for 24h. b, Evaluation 
of electronically differentiated arene substituents in the catalytic carbonyl- 
olefin metathesis reaction using iron(111) chloride as an Earth-abundant 
Lewis acid catalyst. Conditions: ketone (1.0 equiv.), FeCl; (5 mol%) in 
dichloroethane (0.01 M), room temperature, 1-12 h. 17 and 18 were run at 
reflux in dichloroethane. For 17 and 18, yields at room temperature were 
determined by 'H-NMR analysis using naphthalene as internal standard. 


examples of which rely on boron trifluoride diethyl etherate, alumin- 
ium, and Bronsted acid reagents'®?””8, 

To identify a single catalyst for in situ oxetane formation and frag- 
mentation, we initially investigated the reaction of 3-ketoesters bearing 
a pendant isoprenyl group with various Lewis acids (Fig. 2). When 
AIC]; was used equimolar to 3-ketoester 1, the formation of the corre- 
sponding tertiary chloride 2 was observed, while stoichiometric TiCl, 
resulted in the tertiary alkylation product 3 in 74% yield (Fig. 2a). 
Upon reaction with equimolar amounts of SnCl,, two new products 
were isolated in 47% and 24% yield, respectively. The major product 
of this transformation was identified as the tertiary alkylation product 
3, while the minor product was identified as the desired carbonyl- 
olefin metathesis product 4, thereby providing proof of concept that 
a single Lewis acid is capable of promoting both intermediate oxetane 
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Figure 3 | Scope of the iron(111)-catalysed carbonyl-olefin 

metathesis reaction. Conditions: ketone (1.0 equiv.), FeCl; (5 mol%) in 
dichloroethane (0.01 M), room temperature, 1-12 h. 25 was stoichiometric 
in FeCl;. 41 and 42 were run at 60°C using FeCl; (5 mol%), 12h. 30 was 
obtained as a single alkene isomer in 52% yield using Fe(OTf);3 

(5 mol%) at reflux and as a 1.2:1 mixture of alkene isomers in 46% yield 
using FeCl; (5 mol%) at room temperature. The yield for both reactions 


formation and subsequent fragmentation, albeit in stoichiometric 
amounts. No reaction was observed with substoichiometric amounts of 
protic acids and of a variety of Lewis acids, including FeCl:, ZnCl, and 
SnCl, (Supplementary Table 2). Succeeding investigations identified 
iron(111)-derived Lewis acids as superior reagents capable of selectively 
forming the desired carbonyl-olefin metathesis products. Ultimately, 
we identified 5mol% FeCl; in dichloroethane as the optimal set of 
reaction conditions, resulting in quantitative formation of the desired 
carbonyl-olefin metathesis product (Supplementary Tables 3 and 4). 

The optimized reaction conditions proved efficient for various 
substrates (Fig. 2b). Aromatic B-ketoesters bearing strongly 
electron-donating substituents on the arene subunit were converted 
to the corresponding cyclopentene metathesis products (5-9; Fig. 2b) 
in good yields. In particular, the ortho-methoxy cyclopentene 5 was 
formed quantitatively, while increasing the number of methoxy sub- 
stituents led to diminished yields, with 3,4- and 2,5-dimethoxy arenes 
resulting in the formation of 6 and 9 in 66% and 64% yield, respectively. 
A mixture of para-substituted alkene isomers 7 and 8 was obtained in 
quantitative yield when the corresponding para-methoxy }-ketoester 
was subjected to otherwise identical reaction conditions. 

In contrast, the para-acetylated cyclopentene 10 was obtained 
as a single product in 57% yield. 3-Ketoesters bearing different 
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was determined by 1H-NMR analysis using naphthalene as internal 
standard. 28 and 44 were isolated as a mixture of alkene isomers. See 
Supplementary Information for details. The tetrasubstituted alkenes 

28 and 30 represent the thermodynamically more stable products. For 27, 
yield was determined by gas chromatography analysis using dodecane as 
internal standard. For 43, yield was determined by 'H-NMR analysis using 
naphthalene as internal standard. 


substituents on the arene moiety (that is, alkyl, aryl and unsub- 
stituted) were converted in higher yields of up to 99% to form 
cyclopentenes 11-16 (Fig. 2b). Electron-deficient chloro-, bromo- 
and iodoarenes resulted in the formation of the corresponding 
carbonyl-olefin metathesis products 17-20 (Fig. 2) in high yields. 
Taken together, these data demonstrate tolerance of various electron- 
ically differentiated arenes. 

We next sought to expand the scope of the iron(11)-catalysed 
carbonyl-olefin metathesis reaction by investigating different 
B-substitution, heteroatom incorporation, and varied structural 
motifs (Fig. 3). First, we focused on substrates differing in their 
B-substitution pattern and found isopropyl and benzyl esters to be 
tolerated, resulting in the formation of 21 and 22 in 70% and 67% 
yield, respectively. Notably, ketones, sulfonyl and amide functional 
groups were also compatible (23-25; Fig. 3); however, stoichiometric 
iron was required to induce full conversion in the case of the amide. 
Importantly, bis-phenyl cyclopentene 26 and indene 27, which lack 
3-Lewis basic substituents, are efficiently generated from the metathesis 
reaction. Interestingly, tetrasubstituted olefins 28 and 30 were obtained 
under the optimal reaction conditions as the thermodynamically sta- 
ble products. Naphthyl-substituted 29 was also obtained under the 
optimized reaction conditions. Finally, we explored the generation of 
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Figure 4 | Alkene evaluation in the catalytic carbonyl-olefin 
metathesis reaction. Conditions: ketone (1.0 equiv.), FeCl; (5 mol%) 
in dichloroethane (0.01 M) at room temperature. See Supplementary 


cyclopentenes bearing quaternary carbon centres. Quaternary alkyl, 
allyl, vinyl and benzyl groups led to the formation of the corresponding 
cyclopentenes 31-40 (Fig. 3) in up to 99% yield. Additionally, when 
conducted on a 1.0 scale, cyclopentene 40 was isolated in 98% yield, 
indicating the amenability of this protocol to being used on a larger 
scale. Moreover, structurally complex tricyclic and spirocyclic motifs 
were obtained in excellent yields in a single synthetic transformation 
(35, 36 and 38; Fig. 3). Importantly, substrates bearing double alkyl 
substitution in the a-position proved effective in producing the desired 
metathesis adducts, resulting in the formation of cyclopentene 33 in 
44%, vinyl-substituted thiophene 34 in 76%, and vinyl-anisole 39 
in 80% yield. Notably, the scope of the iron(111)-catalysed carbonyl- 
olefin metathesis was successfully extended beyond the initial substrate 
scaffold to tolerate the sterically demanding bis-substitution in the 
«-position as well as chain elongation in the isoprenyl subunit. This set 
of compounds indicates that a -Lewis basic substituent is not neces- 
sary to bind the catalyst to facilitate carbonyl activation. Furthermore, 
the success of substrates bearing a quaternary centre in the a-position 
demonstrates that the formation of a metal enolate, a process often 
observed upon the combination of Lewis acids and 1,3-dicarbonyls, 
is not necessary to form the metathesis product. Extending the olefin 
chain length by one carbon unit results in the formation of 6-membered 
carbocycles 41-44 in up to 71% yield. 

To gain further insight into the controlling features of this process, 
we next turned our attention to the pendant olefin moiety. As presented 
(see earlier), isoprenyl-derived alkenes, such as 1 (Fig. 2a) provide effi- 
cient conversion to the metathesis product. This aspect led us to ini- 
tially consider the possibility of the formation of a tertiary carbocation 
as a potential intermediate for catalysis. To examine this supposition, 
we explored the effect of varying olefin substitution (45-50; Fig. 4), 
which resulted in the formation of the carbonyl-olefin metathesis prod- 
ucts in all cases in good to excellent yields. However, substrates with 
the potential of benzylic stabilization of an intermediate carbocation 
resulted in an overall decrease in yield compared to 1 (45-50; Fig. 4). 
These data suggest that increased stabilization of a carbocation from a 
simple tertiary centre can hinder product formation. Alternatively, the 
bulkier arene substituents present in olefins 45-50 compared to 1 could 
increase the energy of one or more of the transition states necessary 
for product formation. 
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Information for additional experimental details. Terminal alkenes and 
tetrasubstituted alkenes were found to be unreactive under the optimized 
reaction conditions. 47-49 were run at reflux in dichloroethane. 


Collectively, our examination of the substrate scope has provided 
initial insight into the mechanism of iron(111)-chloride-catalysed 
carbonyl-olefin metathesis. The reaction tolerates various aromatic 
substituents, while sterically encumbered olefin moieties inhibit 
product formation. Moreover, metal enolate formation is not required 
to yield the desired product, since compounds bearing quaternary 
carbon centres in the a-position are viable substrates for this trans- 
formation. Additionally, the formation of acetone as the correspond- 
ing carbonyl by-product was observed upon ‘H-NMR analysis of the 
reaction with olefin 1 (using Sc(OTf)3 as a catalyst), consistent with 
the proposed metathesis process. On the basis of these initial findings, 
we propose two possible pathways of product formation that proceed 
via oxetane E. In a mechanism relying on carbocation intermediates, 
substrate A (Fig. 5) initially coordinates to iron(111) to form adduct 
B. This activated carbonyl can subsequently undergo nucleophilic 
attack by the pendant olefin on either face, resulting in the formation 
of carbocycles C and D. The cis-configuration of carbocycle C allows 
for intramolecular trapping of the carbocation to form oxetane E. 
Importantly, this cyclization is not possible for carbocycle D. The high 
yield of product implies that formation of carbocycle D is reversible 
and that it is possible for carbocycles C and D to interconvert through 
B. Oxetane E can undergo subsequent iron(111)-mediated ring open- 
ing to form intermediate F, containing a benzylic carbocation. This 
species can then eliminate carbonyl H and form the carbonyl-olefin 
metathesis product G. In an alternative mechanism, the carbonyl- 
olefin metathesis reaction can proceed in a concerted fashion (Fig. 5), 
which initially also relies on coordination of substrate A to iron(111) 
chloride to form Lewis acid adduct B. However, adduct B then reacts 
in a concerted [2+2]-cycloaddition to form oxetane intermediate E, 
which upon concerted [2+2]-cycloreversion results in the formation 
of carbonyl-olefin metathesis product G and carbonyl by-product 
H (Fig. 5). 

Our subsequent efforts focused on developing mechanistic probe 
molecules to trap potential carbocation intermediates C and F upon 
reaction with suitable nucleophiles (Fig. 5). When substrate 1 was 
reacted under the optimized reaction conditions with equimolar 
amounts of various nucleophiles (methanol, isopropanol, acetic acid, 
benzoic acid), no products resulting from pathways proceeding via 
intermediate carbocations were observed (51 and 52; Fig. 5). In all 
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Figure 5 | Mechanistic hypothesis for the iron(111)-chloride-catalysed 
carbonyl-olefin metathesis reaction. a, Two possible pathways of product 
formation relying on either a concerted or a carbocation mechanism. 

b, Design of mechanistic probe molecules to trap potential carbocation 


cases, the only product obtained was the desired carbonyl-olefin 
metathesis product 4 in up to 98% yield. When mechanistic probe 
molecule 53 bearing a pendant primary alcohol was reacted, the only 
product isolated in the course of this transformation was the metathesis 
product 54, and no product resulting from potential carbocation 
intermediates was observed (55; Fig. 5). Together, these results point 
towards an iron(111)-catalysed carbonyl-olefin metathesis reaction pro- 
ceeding via a concerted mechanism that does not involve carbocations 
as essential intermediates. 

We further investigated the mechanism of the carbonyl-olefin 
metathesis reaction computationally through the ZStruct method”””®. 
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4.3 kcal mol" 


3.8 kcal mol"! 


Reaction coordinate 


intermediates C and F. c, Enthalpic profile and reaction pathway of the 
computationally proposed reaction mechanism for carbonyl-olefin 
metathesis using iron(11) chloride. Ar, generic aryl substituent; R, R', R’, 
generic substituent. 


This computational tool systematically searches a combinatorial set of 
plausible reaction coordinates for kinetically and thermodynamically 
feasible elementary reactions, using quantum chemistry to provide 
accurate predictions for activation barriers and energies of reactions. 
In the case of structure I (Fig. 5), ZStruct generated approximately 800 
reaction coordinate combinations. Upon complete quantum chemical 
evaluation of these reaction paths, the method successfully identified the 
lowest barrier transition state (K) leading to oxetane intermediate (L). 
This unbiased strategy differs from standard reaction simulations that 
typically require individual, selected mechanistic hypotheses for each 
step of a proposed reaction sequence. 
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The full ZStruct search located one highly favourable reaction path- 
way, outlined in Fig. 5c. Iron(111) chloride initially binds to the car- 
bonyl oxygen in I to enable formation of an oxetane intermediate via 
a concerted [2+2]-cycloaddition reaction with an enthalpic barrier 
of 14.5 kcal mol! above the iron-substrate complex I. This step is 
concerted yet asynchronous, with the carbon-carbon bond formed to 
a greater extent than the carbon-oxygen bond in the transition state. 
The resulting iron-bound oxetane L was found to be 4.3 kcal mol™! 
in enthalpy above the starting complex I. A subsequent elementary 
step with a barrier of 15.3 kcal mol’ allows for ring opening to result 
in the formation of metathesis product N and acetone by-product. 
This reaction step was also found to be asynchronous, with the car- 
bon-oxygen bond breaking to a greater extent than carbon-carbon 
bond cleavage in the transition state. In each step, the carbon-oxygen 
distance is elongated more than the carbon-carbon bond distance 
in the transition state. This asynchronicity is consistent with Lewis 
acid activation of the carbonyl oxygen in I, in which the build-up 
of charge on the carbon-oxygen side of the forming or breaking 
oxetane is compensated through charge withdrawal by iron(111) 
chloride. 

To provide additional support for the computationally proposed 
mechanism, rate-limiting activation barriers for ten substrates were 
evaluated. Seven of these substrates (14, 45-50) were predicted to have 
activation barriers suitable for productive catalysis. Subsequent experi- 
ments showed a range of productivity from 0% yield for the highest 
barrier substrates up to 99% yield for lower barrier cases. These yields 
correlate well with the computational predictions, providing consid- 
erable evidence for the concerted mechanism (Fig. 5). These results 
also corroborate the mechanistic probe experiments (Fig. 5), where the 
concerted mechanism is consistent with the inability for carbocationic 
intermediates to be trapped. 

The catalytic carbonyl-olefin metathesis reaction described here 
enables direct carbon-carbon bond construction between carbonyl 
and olefin moieties relying on iron as an environmentally benign 
catalyst. This new method is operationally facile, applicable to gram- 
scale synthesis and provides a distinct advance towards the develop- 
ment of a sustainable and economical approach for carbonyl-olefin 
metathesis. 
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Changing atmospheric CO, concentration was the 
primary driver of early Cenozoic climate 


Eleni Anagnostou!, Eleanor H. John’+, Kirsty M. Edgar?*+, Gavin L. Foster!, Andy Ridgwell*+”, Gordon N. Inglis®’, 


Richard D. Pancost®’, Daniel J. Lunt*” & Paul N. Pearson? 


The Early Eocene Climate Optimum (EECO, which occurred about 
51 to 53 million years ago)', was the warmest interval of the past 
65 million years, with mean annual surface air temperature over ten 
degrees Celsius warmer than during the pre-industrial period” *. 
Subsequent global cooling in the middle and late Eocene epoch, 
especially at high latitudes, eventually led to continental ice sheet 
development in Antarctica in the early Oligocene epoch (about 33.6 
million years ago). However, existing estimates place atmospheric 
carbon dioxide (CO 2) levels during the Eocene at 500-3,000 parts 
per million®’, and in the absence of tighter constraints carbon- 
climate interactions over this interval remain uncertain. Here we 
use recent analytical and methodological developments®!! to 
generate a new high-fidelity record of CO, concentrations using 
the boron isotope (5''B) composition of well preserved planktonic 
foraminifera from the Tanzania Drilling Project, revising previous 
estimates®. Although species-level uncertainties make absolute 
values difficult to constrain, CO, concentrations during the EECO 
were around 1,400 parts per million. The relative decline in CO, 
concentration through the Eocene is more robustly constrained 
at about fifty per cent, with a further decline into the Oligocene’. 
Provided the latitudinal dependency of sea surface temperature 
change for a given climate forcing in the Eocene was similar to that of 
the late Quaternary period'?, this CO, decline was sufficient to drive 
the well documented high- and low-latitude cooling that occurred 
through the Eocene’. Once the change in global temperature 
between the pre-industrial period and the Eocene caused by the 
action of all known slow feedbacks (apart from those associated 
with the carbon cycle) is removed“, both the EECO and the late 
Eocene exhibit an equilibrium climate sensitivity relative to the pre- 
industrial period of 2.1 to 4.6 degrees Celsius per CO, doubling 
(66 per cent confidence), which is similar to the canonical range 
(1.5 to 4.5 degrees Celsius’), indicating that a large fraction of the 
warmth of the early Eocene greenhouse was driven by increased CO, 
concentrations, and that climate sensitivity was relatively constant 
throughout this period. 

Over the past 540 million years, Earth’s climate has oscillated 
between a globally warm ‘greenhouse state’ and an ‘icehouse state’ 
with substantial continental glaciation’*®. The most recent of these 
transitions occurred between the warmest time interval of the last 
65 million years—the EECO (about 14+3°C warmer than pre- 
industrial times*)—and the rapid growth of ice on Antarctica in the 
earliest icehouse state of the Oligocene (~33.6 Myr ago’). It has been 
suggested that variations in the concentration of the greenhouse gas 
CO) were responsible for both the overall warmth of the Eocene 
and the subsequent cooling’. Recent studies have documented the 
importance of CO, decline for the final step into the icehouse across 
the Eocene-Oligocene transition'”'®. Despite this, the few available 


CO, reconstructions vary markedly between different proxy systems, 
obscuring relationships with the global cooling trend’>!””° and there- 
fore preventing a robust test of this hypothesis (Fig. 1). Furthermore, 
fully coupled climate models require CO, concentrations ranging 
from 2x to 16x the pre-industrial value (560-4,480 parts per million 
(p.p.m.); see, for example, ref. 21) in order to replicate reconstructed 
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Figure 1 | Currently available Eocene atmospheric CO, records and 
benthic foraminiferal §!°O values. a, CO, reconstructions”®!?!*!%° with 
the 2 s.d. uncertainties. b, Benthic foraminiferal 5'%O (ref. 1). The green 
line shows the five-point moving average of the grey line, with data age 
modified to account for the revised timing of the MECO*’. Lower benthic 
5!80 values denote warmer bottom-water temperatures (or less ice volume, 
which is assumed to be minimal in the Eocene). Error bars on §!80 are 
smaller than the symbols (Methods). 
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Figure 2 | Eocene planktonic foraminiferal multi-species stable 

isotope arrays. a—e, Foraminiferal 5'°C versus 6'8O for each time slice. 

f-j, Foraminiferal 6!'B values versus 6'8O inferred water depths for each 
time slice. h shows the anomalous behaviour of Globigerinatheka spp. 
(black diamonds) in both 6°C and 6!'B (see Methods). The blue arrow in j 
shows the correction of T. cerroazulensis 5''B to shallower water values 
(Methods). Shaded areas in f-j surround the species used in Fig. 3a. 

Errors on §!3C and 6'80 are smaller than symbols. Error bars on 6"'B are 2 s.d. 
based on long-term precision (Methods). VPDB, the Vienna Pee-Dee 
belemnite standard. For a full species list see Supplementary Table 1. 


early Eocene sea surface temperatures (SSTs) and mean annual air 
temperatures (MATs). Clearly, to use past warm climate states like the 
early Eocene to evaluate models and our understanding of the climate 
system, new and more accurate CO, reconstructions are needed. 

The use of boron isotopes (5''B) in marine carbonates is a well 
understood proxy of seawater pH, allowing for high-fidelity recon- 
structions of atmospheric CO) in locations where surface seawater is 
in near-equilibrium with the atmosphere regarding CO) (see refs 9 
and 22). Recently, the isotopic fractionation factor of boron in seawater 
has been experimentally determined’®, our understanding of the evolu- 
tion of the ocean carbonate system through time has improved’, and 
it has become apparent through inter-laboratory comparison studies® 
that a potential bias relating to the foraminiferal B/Ca ratio may exist in 
earlier studies, suggesting that published Eocene 6''B-CO, estimates’ 
need to be revisited. Here we resolve these issues by measuring the 
5!'B of well preserved Eocene planktonic foraminifera from the Kilwa 
group in Tanzania using multi collector inductively coupled plasma 
mass spectrometry (MC-ICPMS)’ and a greater array of species, to 
resolve the potential for species-specific and depth-habitat-related 
biases in 6''B more clearly. 

The approach we follow to determine atmospheric CO, from 
foraminiferal calcium carbonate &''B (§''B,) is detailed in the Methods. 
We analysed foraminifera from five discrete time slices between 36.9 and 
53.2 Myr ago recovered by the Tanzania Drilling Project (Extended Data 
Fig. 1), and we quantified the oxygen isotopic composition (6'°O) of up 
to 17 different foraminifera species occupying a range of depth habitats 
to derive the calcification temperature and hence the relative habitat 
depth of the taxa”. In each case we find a decrease of 6''B, with increas- 
ing depth, consistent with modern ocean 8!’ Bporate profiles (Fig. 2) 
and a clustering of the warmest species with a similar 6''B.. 

By applying species-specific §''B—pH calibrations, 6''B, measure- 
ments can be converted to pH reconstructions with depth. However, we 
lack such calibrations for these extinct taxa. The 6''B vital effects that 
are a result of foraminiferal physiology (Methods) appear to be reduced 
in Eocene foraminifera compared to modern taxa for both shallow 
symbiont-bearing and deeper non-symbiotic planktonic foraminifera. 
We therefore calculate pH from 6''B, using two different approaches 
to bracket the probable magnitude of 6''B—-pH vital effects: (1) we 
assume that 6!'B. = &!Brorate (hereafter the ‘borate’ calibration), and 
(2) we assume that the shallower foraminifera in each time slice has the 
6''B-pH calibration of the modern mixed-layer-dwelling foraminifera 
Trilobatus sacculifer”. 

The boron isotopic composition of seawater (61'B,,,) is also required 
to calculate pH, constrained here by two methods, both based on the 
observation that the reconstructed pH gradient from shallow to deep 
for a given 8" Byorate gradient in the water column is a function of 8! Byy 
because of the nonlinearity of the &''B—pH proxy (Extended Data Fig. 2, 
Methods). If we assume 6!'B. = &!!Brorate We Calculate §''B,,, to be in 
the range 38.2-38.7%bo. This is slightly increased to 38.6-38.9%o if we 
use the 6!!B-pH calibration of T: sacculifer (Methods). 

Given the strong positive relationship between 6''B and pH, it is 
evident from our 6''B, data alone that surface water pH increased 
through the Eocene (Fig. 3a). However, having determined both 8! Bwy 
and calcification temperature, we can generate quantitative estimates 
of upper-ocean pH, and here we propagated a conservative estimate of 
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Figure 3 | New atmospheric CO, reconstructions from shallow 
planktonic foraminiferal 5!'B. a, 6!'B of shallower foraminifera 
(symbols as in blue-shaded areas of Fig. 2) and asterisks represent 
the average of those for each time slice. Blue squares and orange 
circles represent the warmest species used in b. Ref. 12 data are offset 
(Supplementary Table 2, Methods). b, Atmospheric CO, assuming 
5''B. =8'' Bporate using the warmest species of each time slice. The red line 
is based on ref. 11. c, Benthic foraminiferal §'%O (ref. 1) (as in Fig. 1b). 
Age error bars (b) are based on the ages of the nearest datums, 5''B errors 
are as in Fig. 2, and the CO, uncertainty is based on the 95% confidence of 
Monte Carlo error propagation (Methods). 


the relevant uncertainties using a Monte Carlo approach (Methods). 
Based on these upper-water-column reconstructions and the borate 
(or T. sacculifer) calibration, we estimate that surface water pH increased 
from 7.66 + 0.06 (or 7.59 + 0.06) at 53.2 Myr ago, to 7.80 + 0.07 
(or 7.76 + 0.06) by 36.9 Myr ago. By extending our knowledge of species 
depth habitat and 6!'B,, to the 5''B, data set of ref. 12 (Methods), it 
is apparent that surface-water pH continued to increase in the early 
Oligocene, reaching 7.88 + 0.05 (or 7.84+ 0.05) by ~33 Myr ago. 
Notably, our Eocene surface-water pH estimates are lower than any 
others reported in the Cenozoic era, including during the warmth of 
the middle Miocene and Pliocene: minimum pH values of 7.83 (ref. 22) 
and 7.99 (ref. 26), respectively. 

Ocean pH, like seawater CO,—that is, [CO2],,—is largely deter- 
mined by the ratio of total alkalinity to total dissolved inorganic 
carbon. Hence, to first order, our pH estimates also indicate that 
[CO] aq declined through the Eocene. Given the oligotrophic setting 
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of our site, this further implies that atmospheric CO) declined through 
the study interval. However, quantitative constraints on early Cenozoic 
CO, evolution require a second carbonate system parameter. Here we 
assume that the surface seawater saturation state for calcite (2...) at our 
site remained between 5.5 and 7.5, a conservative range that encom- 
passes geochemical model estimates”’ (Methods; Extended Data Fig. 3). 
Uncertainty was again fully propagated (details in Methods). 

As expected given the raw 6''B, values and calculated pH, we 
find that CO, decreased (Fig. 3b) for the borate calibration (or the 
T. sacculifer calibration in Extended Data Fig. 4) from 1,400 +470 p.p.m. 
(or 1,900 + 600) p.p.m. during the EECO to <770 p.p.m. (or 920 p.p.m.) 
from 40.3 Myr ago onwards, reaching a minimum of 550 + 190 p.p.m. 
(or 670 +210 p.p.m.) in the early Oligocene. Our new data confirm that 
CO), was most probably > 1,000 p.p.m. (at a 95% confidence interval) 
during the EECO, consistent with the recently revised estimates using 
the sodium carbonate mineral nahcolite’'. Although our younger time 
slices are in good agreement with alkenone-based CO, reconstruc- 
tions (Extended Data Fig. 4), our early Eocene CO; reconstructions 
are clearly unprecedented in the Cenozoic. At 5—7 x the pre-industrial 
value, our reconstructed EECO CO) value can provide tighter 
constraints on models than those that have been previously available. 
Critically, our CO2 reconstructions are lower than is required by a num- 
ber of coupled ocean—atmosphere climate models to achieve a good 
match with Eocene temperature data’’. 

Using our new CO, record, the possible role of CO. in driving 
Eocene cooling can be quantitatively examined. The relative change in 
climate forcing attributable to CO change (AF ¢o,,-vs-rEco) through the 
Eocene can be estimated”® as: 


AF 0, -vs-EECO = 5.32In(C; /Cezxco) + 0.39[In(C, /Crzco) |” (1) 


where Cggco is the atmospheric CO concentration at the EECO, 
and C; refers to the CO, reconstruction at a particular time in the 
Eocene. The CO, forcing based on our reconstructions for the Eocene 
is well correlated with the contemporaneous proxy-derived sea sur- 
face cooling, albeit with considerably larger cooling at high latitudes 
(Fig 4a, Extended Data Fig. 5a, R?=0.8 at low latitudes and 0.9 at high 
latitudes). 

The apparent latitudinal dependency of SST change for a given 
change in radiative forcing has recently been determined for the last 
520,000 years (that is, 520 kyr; ref. 13), using CO, ice-sheet albedo 
(which are slow feedbacks), dust and mean annual insolation as forc- 
ing terms’. Assuming that this latitudinal dependency for a given 
forcing (Fig. 4b and Methods)’ is also appropriate for the Eocene, we 
can calculate the expected SST change as a result of the climate forc- 
ing, AF co, (see Equation (1)), for each target time slice in our study, 
for high latitudes (60° N and 60° S; blue lines on Fig. 4b) and low 
latitudes (30° N to 30°S, red box on Fig. 4b; see Methods) and compare 
these to a recent SST compilation’*. The calculated SST change from 
our CO, reconstructions is within error of the observed, proxy- 
derived SST evolution of both high and low latitudes through the 
Eocene (Fig. 4c, Extended Data Fig. 5c) regardless of which §''B-pH 
calibration is used. This supports CO; as the principal driver of long- 
term Eocene cooling, although feedback processes involved in causing 
the observed polar amplification in Fig. 4a and Extended Data Fig. 5a 
must also play a vital part in determining its pattern. 

Our CO, reconstructions also allow us to examine the sensitivity of 
the Earth’s temperature to changes in radiative forcing. This is often 
discussed in terms of Equilibrium Climate Sensitivity (ECS): the global 
mean surface temperature change (in degrees Celsius) for a doubling 
of CO; after all those feedbacks that operate on the timescale of about 
a century have had time to act (the ‘fast’ feedbacks”). The ECS can be 
calculated for the Eocene using: 


BCS = AT/AF co,-vs-PI x 3.87 (2) 
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Figure 4 | CO, as a driver of latitudinal cooling in the Eocene, and 

ECS analyses of the EECO and late Eocene time slices. a, Evolving 
relationship between SST" for high and low latitudes and the CO; forcing 
of each of our time slices relative to the EECO, with linear regression 

fits and coefficients of determination (R7). b, Apparent latitudinal SST 
sensitivity for the last 520 kyr (ref. 13). The dashed line is the second-order 
polynomial through the SST sensitivity data (grey crosses) of ref. 13. Red 
shading shows the SST sensitivity estimates averaged as a low-latitude 
mean, and blue lines show the high-latitude mean. c, Reconstructed 
(lines) and estimated (symbols) SST relative to 53.2 Myr ago. Symbols are 
estimated using the respective CO reconstructions and the average 
low-latitude (red circles) and high-latitude (blue triangles) SST 


where AT is the global mean surface temperature change compared to 
the pre-industrial period, corrected for the action of those feedbacks 
that operate on timescales longer than a century (the ‘slow’ feedbacks”’) 
(Fig. 4d, Extended Data Fig. 5d); AFco,-ys.py is calculated as in 
Equation (1), replacing Cggco with pre-industrial CO, (278 p.p.m.) 
because we are interested in forcing change relative to the pre-industrial 
period (Fig. 4e, Extended Data Fig. 5e); and 3.87 is the climate forcing 
of a CO, doubling (in units of Wm’), calculated using Equation (1). 

The global mean surface temperature change for the EECO is thought 
to be ~14+3°C warmer than the pre-industrial period, and ~5°C 
warmer than the late Eocene (35 Myr ago; refs 2 and 29). Recent climate 
modelling studies have ascribed 4—6°C of Eocene warmth to changes 
in boundary conditions (such as palaeotopography and continental 
configuration) and the action of the slow climate feedbacks (princi- 
pally the lack of a continental ice sheet on Antarctica and vegetation 
change’). Ifatmospheric CO, concentration were the principal driv- 
ing mechanism for the relative warmth of the EECO and late Eocene, 
we would expect the Eocene ECS value to fall within the range reported 
by the Intergovernmental Panel on Climate Change’, based on climate 
modelling and observations of the (palaeo)climate system. However, if 
other climate feedbacks were important that we have not accounted for 
in our correction of the ‘slow’ feedbacks”, or if the forcing is not due 
to CO, alone, then the calculated ECS value would be different from 
that reported by the IPCC. Full details of how the uncertainties in these 
various parameters are propagated can be found in the Methods, but 
from Fig. 4f and Extended Data Fig. 5f it is clear that the average ECS 
we calculate for the Eocene lies, for the borate (or T. sacculifer) approach 
in the range 2.6-4.6 °C (or 2.1-3.6°C) per CO, doubling (66% confi- 
dence) with maximum probability for the EECO of 3.8°C (or 3.3 °C). 
This range (2.1-4.6°C per CO, doubling) is similar to the IPCC range 
(1.5-4.5°C at 66% confidence’). Therefore, the 65% of early and late 
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sensitivities of b. Bold lines show the reconstructed long-term mean SST 
estimates using the TEXg6 proxy'* at high latitudes (blue >55°) and low 
latitudes (red <30°) relative to the SST ~53 Myr ago. Error bars represent 
full propagation of errors at 95% confidence. d, Range in mean surface 
temperature change AT for early and late Eocene, corrected to remove 
the changes due to slow feedbacks***!”?. e, Forcing compared to the 
pre-industrial period calculated using our CO) reconstructions for the 
time slices 53.2 Myr ago (early Eocene) and 36.9 Myr ago (late Eocene). 

f, Probability density functions of ECS for the early and late Eocene 
compared to IPCC estimates. Shaded bands around lines throughout show 
95% confidence intervals. 


Eocene warmth attributed to fast feedbacks?~*”? is consistent with 


Eocene warmth being driven by higher atmospheric CO, concentra- 
tions. This supports the importance of atmospheric CO) variability in 
determining both the warmth of the early Eocene and the cooling that 
ultimately led to the establishment of the late Cenozoic icehouse. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Site information. To obtain optimal records of oxygen, carbon, and boron 
isotopes, we used well preserved ‘glassy’ carbonate microfossils from the Palaeogene 
of the Kilwa Group material drilled by the Tanzanian Drilling Project*!*. The core 
sites were located on the outer shelf or upper continental slope approximately 
50-70km offshore’, at a palaeolatitude of 19°S, and bathyal palaeowater depths 
of >300 m (ref. 31). Seismic data, sedimentary facies, nannofossil and planktic 
foraminiferal assemblages as well as low concentrations of dinoflagellates and a 
lack of marine biomarkers indicative of high export productivity confirm that these 
sediments were deposited under open-ocean conditions with waters derived from 
the Indian Ocean subtropical gyre** °°, supporting the assumption that [COp].aq 
offshore Tanzania was in (near) equilibrium with the atmosphere with respect to 
CO,. This is supported by the ‘cGENIE’ Earth system model estimates with Eocene 
boundary conditions for offshore Tanzania (Extended Data Fig. 6)°’; neverthe- 
less, to include the uncertainty of potential disequilibrium with the atmosphere 
we added +40 p.p.m. CO, uncertainty into our estimates of atmospheric CO 
using 6!'B. 
Sample preparation. Approximately 3-4 mg of mono-specific foraminiferal mate- 
rial of a narrow size fraction (for example, ~300 individuals from the 212-250,1m 
sieve size fraction) were separated from ~10cm of core material (24cm for the 
time slice 53.2 Myr ago) for tandem analyses of all isotopes (6!8O, 6'°C and &'!B) 
and trace-element composition. Identification of planktonic foraminifera followed 
ref. 35. Each sample was gently crushed and divided into two parts: one for 6''B 
and elemental analyses and another for 6!8O and 6!°C measurements (~100 1g, 
~3% per weight). For the timeslices 53.2 Myr ago and 40.3 Myr ago, Cibicidoides 
sp. were also picked for analyses (~1 mg). Splits for elemental and 6'!B analyses 
were cleaned following established methods***”. Trace-element ratios were deter- 
mined as in ref. 9 and in all cases Al/Ca ratios were <120,.mol mol“? (typically 
<100,1mol mol™'), showing efficient foraminiferal cleaning”. 
5!80-8C analyses, and temperature and relative depth assignments. For all 
5'80 and 61°C analyses, we followed the analytical approach described elsewhere"!, 
and values are reported relative to the Vienna Pee Dee Belemnite (VPDB) standard, 
with 1 s.d. standard external precision of +0.06%o and +0.07%o respectively. 
All temperature reconstructions are based on the equation of ref. 42, corrected 
for changes in global ice volume of —0.8%o*? and palaeolatitude of +0.83%o4 
(following ref. 45). Any further secular trends in seawater 6'80, regional varia- 
tions in the oxygen isotope ratio of seawater, or differing carbonate ion concentra- 
tions are not known with sufficient confidence, so no additional corrections were 
applied. The lowest 6!8O value in each multi-species depth profile was used to 
calculate the maximum SST for each time slice with a conservative uncertainty of 
+2°C that encompasses the differences among variable carbonate §!°O equations 
(ref. 45), and includes potential uncertainties in 6'8O,, (refs 43 and 46). 
Identification of foraminiferal depth habitats is based on 6'8O following refs 25 
and 35. This method could carry large uncertainties in the absolute depth recon- 
structions, but the relative depth habitats of surface and deep taxa are clearly 
distinguishable from the 6'°C versus §'%0 cross plots (see, for example, refs 25, 
45, 47-54), and supported by comparison of §!'B versus 6'8O profiles (Fig. 2). 
Further support for our approach is given by the depth assignment of benthic 
foraminifera Cibicidoides based on 6'8O values at 53.2 and 40.3 Myr ago, which 
at 300-320 m agrees well with the site’s palaeowater depth*’. Any uncertainty in 
the absolute depth assignment of foraminifera as a result of the 4°C range (+2 °C) 
contributes <0.1% uncertainty to the carbonate system calculations and thus is 
not included, although the temperature uncertainty is fully propagated into our 
carbonate system estimates (see below). 
Boron isotope proxy and analyses. Boron isotopes in planktonic foraminifera 
residing close to the seawater surface have been used extensively to reconstruct past 
ocean pH and thus CO) concentrations”””*>~*’. External reproducibility of §!'B, 
analyses is calculated using the approach of ref. 40 and the relationship of ref. 41. 
To reconstruct seawater pH and thus CO) concentrations on million-year 
timescales using foraminiferal §''B values, we must take some steps that include: 
(1) correcting for any vital effects in extinct species; (2) constraining the 6!'B,,, 
which, given the 10-20 Myr ago residence time for boron in the ocean is probably 
different from the modern value (39.61%o)°*?; and (3) estimating a second car- 
bonate system parameter to infer aqueous CO) concentration ([CO>],,) from pH. 
Other necessary steps include (4) reconstructing the calcification temperature and 
relative habitat depth of foraminifera; (5) knowing the CO) disequilibrium between 
surface waters and atmosphere at the study site; and (6) ensuring that primary 6!'B 
values are not altered by diagenesis (mitigated here by using ‘glassy’ foraminifera). 
Vital-effects corrections associated with the boron isotope proxy. Offsets 
between the boron isotopic composition of ambient seawater borate ion (8! Byorate) 
and that of foraminiferal calcite (6!'B.) are known as vital effects. Such vital 
effects develop in response to respiration, calcification and photosynthesis 
generating a pH gradient within the micro-environment (the diffusive boundary layer) 
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around foraminifera®*'. For extant species these effects can be accounted for using 
species-specific §''B-pH calibrations (for example, refs 9, 62-64). However, 
all Eocene foraminifera species are extinct and thus vital effects can only be 
assessed indirectly. To supplement our existing understanding of modern vital 
effects?*76465 we generated core-top (Holocene) planktonic foraminiferal 5!’B, 
5!8O and &'°C values (Extended Data Fig. 7c and Supplementary Table 1) from 
offshore Tanzania (Glow 15, ref. 66). Modern foraminifera from offshore Tanzania 
show a substantially less coherent relationship with '°O than our Eocene samples 
(Extended Data Fig. 7 and Fig. 2). This increased spread is at least partly due to 
the known large vital effects of modern foraminifera, where offsets of 0-4%o are 
required for the 6''B of our modern species to match the 6''B of seawater borate 
(Extended Data Fig. 7c). 

If we apply published modern symbiont-bearing planktonic foraminifera 
5''B-pH calibrations’”°*™ for the shallower Eocene species, and symbiont- 
barren 6''B-pH calibrations of modern planktonic foraminifera*® for the Eocene 
deep dwellers, an unrealistic picture emerges (Extended Data Fig. 8). Specifically, 
there is increased scatter between the shallowest and deepest planktonic foraminif- 
era and inversion in pH gradients with depth. Additionally, there is clear disa- 
greement between Cibicidoides, that we know exhibits minimal vital effect for 
580, 6B and 6C (refs 40, 67), and deeper planktonic foraminifera of similar 
depth (Extended Data Fig. 8), which are otherwise overlapping in 6'B, (for the 
53.2-Myr-ago time slice, S. roesnaesensis §''B = 12.96 + 0.38 %o is in agreement 
with Cibicidoides, which average 13.15 +0.53%o, and for the 40.3-Myr-ago time 
slice, the average Subbotina 6''B = 12.63 + 0.38%o is within uncertainty of 
the Cibicidoides §''B = 12.71 + 0.30%o). This suggests that modern vital effects are 
probably not applicable to Eocene planktonic foraminifera. Given that modern 
vital effects appear to be unsuitable for the Eocene, as explained above, we argue 
by extension that the reduced scatter in the Eocene, and notably the agreement 
in 6''B, among the mixed-layer species targeted in the Eocene (in stark contrast 
with modern values; Extended Data Fig. 7c), suggests a more minor role for vital 
effects in determining 6''B, and therefore that the targeted Eocene foraminifera 
must more closely reflect the 5'’B of ambient borate ion. Furthermore, different 
size fractions of the Eocene foraminifera belonging to the genus Acarinina from 
this study (Supplementary Table 1), and for Early Eocene/Palaeocene Morozovella 
velascoensis and A. soldadoensis in ref. 68 have §''B, compositions within ana- 
lytical uncertainty. This contrasts with the modern symbiont-bearing planktonic 
foraminifera T: sacculifer and Globigerinoides ruber that exhibit large changes in 
5''B with size (1%o to 2.3%o; refs 64, 69), again supporting our assertion that vital 
effects are reduced in the Eocene foraminifera. 

Although we do not yet have a full mechanistic understanding of the causes 
of 6"'B vital effects in planktonic foraminifera, a reduction in vital effects for the 
Eocene foraminifera would most probably result from a reduction in the relative 
magnitude of one or more of the following physiological processes: calcification, 
photosynthesis or respiration. For instance, for smaller modern foraminifera the 
magnitude of the influence of symbiont photosynthesis on the pH of the diffu- 
sive boundary layer is reduced, with smaller-size-fraction foraminifera recording 
6!'B. closer to that of seawater borate®®. Here, the size of our shallower Eocene 
foraminifera is mostly within 212-250,m, which is reduced compared to the 
modern foraminifera analysed here (300-355 1m) and elsewhere (>425 pm; 
ref. 70). Also, vital-effect offsets are pH-dependent (see Extended Data Fig. 9) for 
symbiont-bearing foraminifera *”!. Eocene symbiotic planktonic foraminiferal 
composition is 15.5-14.5%o, and for this range the T. sacculifer (300-355 1m; 
ref. 22) calibration has a minimal effect on pH estimates, as it requires only a 
0.1-0.3%o correction on 6''B, for deriving 5! 'Bporate. 

Importantly, while the modern symbiont-bearing foraminifera calibrated to 
date have &''B composition offset from the seawater borate curve, only §''B-pH 
calibrations for three dinoflagellate hosting species have so far been published. 
Our new depth profile for the modern Indian Ocean shown in Extended Data 
Fig. 7c shows that some extant species (Globigerinella siphonifera and Globorotalia 
menardii) have minimal §!'B, offsets from seawater borate 6B. 

Additional support for minimal vital effects in Eocene foraminifera is also given 
by the relative agreement with independently calculated Eocene CO.>'»? when 
the borate 5!’B-pH calibration is used (Extended Data Fig. 4). Nonetheless, we 
also apply the modern T. sacculifer calibration, because this provides an upper 
limit on the likely extent of &''B vital effects in Eocene foraminifera. Notably, our 
calculated pH and CO, estimates for both approaches are largely within uncertainty 
(Extended Data Fig. 4), highlighting that such assumptions have little influence on 
the conclusions drawn (Fig. 4 and Extended Data Fig. 5). 

Constraints on 5!!B,,,. Geochemical box models”® and existing records of benthic 
foraminiferal 6''B,, coupled with assumptions regarding deep-water pH evolution”, 
indicate that §!! By is unlikely to have changed from its modern value by more than 
2.5%o since the EECO (with a likely maximum rate of change of 0.1% per million 
years)°®. Here, we use new, empirically constrained, upper and lower bounds on 
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Eocene 5!!B,,, from several independent approaches. The underlying methodology 
for quantifying a maximum and minimum 6!'B,, is based on the observation that 
the 6''B versus pH is nonlinear (Extended Data Fig. 2), and its curvature is affected 
by §''B,3 when 6!'B,y is higher, for any given 6'!B, gradient the reconstructed 
pH change is larger than when 6!'B,,, is lower. Furthermore, the curvature in the 
5'!B-pH relationship results in there being a pH below which the 6!'B system 
becomes unresponsive. The lowest foraminiferal §''B, recorded in this study there- 
fore provides the first constraint on the 6!'B,,,, which then places an additional 
upper bound on 6!'B,,,. The lowest 6''B. values we recovered are for Subbotina 
crociapertura of 11.54+0.29%bo at 45.6 Myr ago, and S. eocaena of 11.97 +0.22%o 
at 40.3 Myr ago. If §!'B,, > 39.5%o, these samples do not give a valid mathematical 
solution for the 6!'B versus pH relationship, so Eocene 6!!B,,, must be lower than 
the modern (39.6%o; ref. 59). 

The lower bound for 6!'B.y is constrained using the modern pH gradient 
(corrected for anthropogenic dissolved inorganic carbon) within the top ~300 m 
of the water column in the tropics and assuming that this represents the probable 
minimum pH gradient in Eocene. This assumption is justified by (1) a posi- 
tive linear relationship between 5°C and pH in modern” and Eocene seawater 
(Extended Data Fig. 10 based on cGENIE output”°), and (2) larger 5'°C gradients 
between surface and deep-dwelling Eocene planktonic foraminifera than in the 
modern because of enhanced metabolic rates due to warmer ocean tempera- 
tures’’. As a result, Eocene pH gradients were most probably larger than the 
modern equivalent and therefore our approach is conservative. Modern tropics, 
in non-upwelling regions, have an average pH gradient of the first 300 m equal to 
0.17 (ref. 74) with an anthropogenic CO, correction resulting in an uncertainty 
of 0.003 pH units (GLODAP v1.1; ref. 75). The 53.2-Myr-ago timeslice is used 
for this treatment because it has full upper-water-column depth coverage and 
the smallest &''B. gradient (hence it will provide the smallest pH difference and 
thus the minimum 6"'B,, value). A Monte Carlo propagation of uncertainties 
in pH, temperature, and composition of seawater results in the pH difference 
(assuming 6''B. = &''Byorate) between the shallowest and the deepest planktonic 
dwellers (ApH) at 53.2 Myr ago falling below 0.17 (the modern equivalent) when 
5" Bsw < 38.2%o (with 86% of simulations resulting in >0.17 ApH). Therefore 
the minimum &!'B,, is calculated at 38.2%o for the whole Eocene. This exercise 
was repeated using the alternative methodology for dealing with 5!'B vital effects 
(assuming §!'B. = 6! Bporate for the deeper-dwelling non-symbiotic foraminifera, 
which have 6'!B, within error of co-occurring Cibicicoides, and assuming that 
the shallowest foraminifera has the 6!'B-pH calibration of modern T. sacculifer). 
The minimum constraint from 86% of potential Monte Carlo solutions is 
then 38.6%o. 

To further constrain the maximum Eocene 6!!B,, we used restrictions offered 
by estimates of Apparent Oxygen Utilization (AOU), adapted from previous 
approaches’. AOU describes the amount of dissolved oxygen consumed by the 
remineralization of sinking organic carbon between a selected depth and sat- 
urated surface waters. The surface seawater oxygen concentration—that is, 
[O2.<w] sat depends on SST as a result of Henry’s law. Here, 6'8O-derived SSTs 
are ~30-33 °C (Supplementary Table 1), so corresponding [O2-..]sat values are 
~191-182molkg™! (ref. 76), with potential mean oversaturation of ~13,:molkg~! 
(low-latitude maximum: 0.2-0.4 ml per litre’”’°). To calculate the upper estimate 
of 6!'B,,,, the minimum SST of the Eocene time slices is selected, and thus the 
maximum [O}.sw]sat is calculated to be 204 umolkg~!. Additionally, we assume 
Eocene planktonic foraminifera have an oxygen minimum tolerance thresh- 
old of 44.7 pmol kg™!, analogous to the low-oxygen-accustomed Hastigerinella 
digitata” in the modern Monterey Bay (California). Therefore, we assume Eocene 
deeper-dwelling planktonic foraminifera were bathed in waters with oxygen 
concentrations equal to or higher than the minimum oxygen level of H. digitata 
(AOU = 204 nmol kg~! minus 44.7 pmol kg~! = 159.3 zmolkg~'). To calculate 
AOU for each time slice studied, we modified 6"'B,,, (similarly following the 
Monte Carlo approach explained above) from the minimum constraint of 38.2%o 
to modern seawater (39.6%o; ref. 59). We then first calculate the difference between 
the calculated (initially assuming that §'’B. = 8" Byorate) dissolved inorganic carbon 
at the depth occupied by our deepest-dwelling foraminifera (average), and compare 
this to the dissolved inorganic carbon of the shallower-dwelling (within ~50m 
of the surface on average) planktonic foraminifera. The difference in dissolved 
inorganic carbon is then transformed into AOU using Redfield ratios (106:138, 
CO :O2). The AOU is <159.3 pmol kg~! when &''B,y is at maximum 38.5%o for 
the 45.6-Myr-ago time slice and 38.7%o for the 40.3-Myr-ago time slice, resulting 
in 86% of potential solutions with our Monte Carlo approach satisfying the AOU 
constraint. The same approach was used applying the T. sacculifer calibration 
for the shallowest planktonic foraminifera, resulting in a maximum 6!’ Byy of 
38.8% and 38.9%o for the 45.6-Myr-ago and 40.3-Myr-ago time slice respectively. 
To be conservative, we use the maximum of the two estimates for each calibra- 
tion method. In summary, although we are calculating the lower 5''B,,, bound 


at 53.2 Myr ago and the upper at 40.3 and 45.6 Myr ago, we apply this range across 
the whole of the study interval (53.2-36.9 Myr ago). 

Second carbonate parameter. After calculating seawater pH using 5''B,y and 
5'!B., an additional carbonate parameter is required to calculate CO, concen- 
trations at any given seawater salinity and temperature. Here the second param- 
eter we use is the saturation state of calcium carbonate (surface (2.4. = [CO377]/ 
[CO37~ ].aturation). Typically, for the pre-industrial tropical surface ocean (2-aic is ~6, 
while for Eocene Tanzania waters it is ~7 (ref. 27; Extended Data Fig. 3). In support 
of the narrow range of potential {2.,),, a variety of modelling studies of the early 
Cenozoic show that surface water 2.1, remains within +1, essentially constant in 
the early Cenozoic, independent of model boundary conditions”**° *”, Therefore, 
here we assume surface {2.1- was equal to 6.5 + 1 throughout the Eocene. 

Monte Carlo pH-CO,; estimates from planktonic foraminiferal 5''B,.. 
Atmospheric CO) was calculated using a Monte Carlo approach to solving the 
relevant carbonate system equations with 10,000 iterations, deriving mean, upper 
and lower bounds of 95% of the simulations. We use the following constraints 
for our calculations: average seawater composition for [Ca], =17mmolkg™' 
(refs 83-87), [Mg].w =38 + 3mmolkg (ref. 84) (see also Supplementary Table 2), 
salinity 35 +2, and temperature +2 °C (with temperature derived from §'°O of 
the shallowest-dwelling species). All simulations are iterated assuming a Gaussian 
distribution of these parameters within the stated 2 s.d. A similar distribution is 
also applied to the 6''B, of the warmest and thus shallowest-dwelling species for 
each timeslice (Supplementary Table 2). We did not have sufficient shallow dwell- 
ers in the 36.9-Myr-ago time slice for boron isotope work. Thus, the intermediate 
dweller Turborotalia cerroazulensis was used. T. cerroazulensis was also analysed in 
the 40.3-Myr-ago time slice, and its §''B, and 6'8O offset (and uncertainty) from 
Morozovelloides lehneri (the shallowest dweller in the same timeslice) was applied 
to estimate the shallowest temperature at 36.9 Myr ago (Supplementary Table 2). 
The temperature offsets between species beyond the +2 °C uncertainty applied to 
our calculations are considered unlikely. Note that a Gaussian distribution is not 
applicable to §!'B,y because there is equal likelihood that it falls between the min- 
imum and maximum constraints; we therefore applied a uniform ‘flat’ probability 
5'' Baw for the Monte Carlo simulations. The relative magnitude of each component 
of our error propagation is shown in Extended Data Fig. 4. 

Correction to pH-CO, estimates from T. ampliapertura §''B,. We com- 
pared 6!8O measurements of the surface dwellers Planoglobanomalina and 
Pseudohastigerina in the Eocene and early Oligocene***® with that of T. amplia- 
pertura used in ref. 12, and confirmed that T. ampliapertura gives a slightly cooler 
temperature than Pseudohastigerina. On average, when contemporaneous shallow- 
dwelling planktonic foraminifera are available, T. ampliapertura are offset by about 
+0.3%o in 6'8O and therefore occupy a shallower water depth than T: cerroazulensis 
(whose 6'8O value is consistently ~0.5%o higher than contemporaneous Eocene 
shallow-dwelling planktonic foraminifera). Further, considering the 5''B offset 
between T! cerroazulensis and the shallow-dwelling M. lehneri in the 40.3-Myr-ago 
time slice, we apply a +1%o correction to T: ampliapertura §''B values of ref. 12 to 
estimate surface seawater values. 

Climate sensitivity calculations. Our new CO, reconstructions allow us to exam- 
ine their role in driving long-term Eocene cooling. Climate forcing compared to 
the early Eocene timeslice at 53.2 Myr ago is calculated from equation (1)”* for our 
two CO) scenarios. We then assume that the apparent latitudinal SST sensitivity 
for the Eocene was similar to that of the last ~520 kyr (ref. 13). Estimates of SST 
change per Wm ” forcing for each degree of latitude were generated using the 
second-order polynomial fit through the data of ref. 13 (dotted line in Fig. 4b, 
Extended Data Fig. 5b). As a high-latitude SST sensitivity we used the average of 
estimates for 60° N and 60°S (1.63 K per Wm ”); as a low-latitude estimate we used 
the average of values between 30°S and 30°N (0.36 K per Wm ”). We use the ratio 
of high- or low-latitude SST sensitivity and forcing for each timeslice to calculate 
the SST change expected as a result of our reconstructed CO, forcing. The product 
gives the estimated SST change of Fig. 4c and Extended Data Fig. 5c for high (blue 
symbols) and low (red symbols) latitudes driven by CO). The uncertainties (Fig. 4c, 
Extended Data Fig. 5c) are the product of 1,000 realizations of each CO estimate 
within its uncertainty envelope (95% confidence intervals). 

To test the importance of CO, to early and late Eocene warmth, we calculated 
the sensitivity of the Earth system to radiative forcing, the ECS. For AT we use 
global mean surface temperature for the EECO and late Eocene, where the EECO 
AT is thought to be ~14+3 °C warmer than the pre-industrial global mean sur- 
face temperature, and ~5 °C warmer than the late Eocene global mean surface 
temperature (35 Myr ago)””’. Of this AT, 4-6 °C is attributed to changes in boundary 
conditions (palaeotopography and continental configuration of the Eocene) and 
the action of the slow climate feedbacks (mainly the lack of a continental ice sheet 
on Antarctica and vegetation change’ “), for both the early and late Eocene. The 
uncertainties in Fig. 4d to 4f (and Extended Data Fig. 5d-f) are the products of 
1,000 realizations of the 53.2-Myr-ago and 36.9-Myr-ago CO) values and the AT 
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estimate based on randomly sampling each variable within its 95% confidence 
interval uncertainty envelope. The uncertainty for the CO, and AT estimates fol- 
lows a Gaussian distribution, and for the slow-feedback correction on AT follows 
a uniform ‘flat’ probability. 
Data. The underlying data of this manuscript may be found in the Supplementary 
Tables 1 and 2 (refs 91-94). 
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Extended Data Figure 1 | Palaeogeography and 5'%O-derived temperature depth zero for each time slice. Note that depth assignments are approximate 
against foraminiferal calcification depth. a, Approximate palaeoposition (see Methods). Also shown are the reconstructed temperatures from core- 
of Tanzanian Drilling Project (TDP) sites studied here (map generated from top (Holocene) planktonic foraminiferal §'8O (Supplementary Table 1), 
www.odsn.de). b, Reconstructed temperature (Tgocene) and relative depth where G. ruber is assigned a depth of zero and the rest of the planktonic 
of each foraminifera within each time slice. The pale blue line represents foraminifera are offset to reproduce the measured temperature profile of 
the output of a General Circulation Model simulation run with Eocene the GLOW15 cruise (light blue/green line)°”**. For Holocene temperature 


boundary conditions®””’, whereas the other coloured lines show the General _ reconstructions we used the modern site latitude of 9°S, and SOx, 
Circulation Model output offset to intersect with the warmest temperature at (using the Standard Mean Ocean Water (SMOW) standard) of 0%o. 
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Extended Data Figure 2 | 5!'B versus pH as a function of 5'!B,,. Increasing 5''B,,,, as indicated for each line in units of per mil, results in lower pH for 
the same 6''B. However, for the same 5"'B range, the reconstructed pH range is larger for higher 5''B,,, (see brown-shaded regions). 
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Extended Data Figure 3 | cGENIE estimates of calcite saturation in 
surface waters. Comparison of calcite saturation 2). for pre-industrial 
times (PI; blue) and 55 Myr ago (red) at different latitudes. 
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Extended Data Figure 5 | CO, as a driver of latitudinal cooling in the 
Eocene, and ECS analyses of the EECO and late Eocene time slices. 
The case for the T. sacculifer calibration applied to shallowest planktonic 
foraminifera. a, Evolving relationship between SST" for high (blue) and 
low (red) latitudes and the CO, forcing of each of our time slices relative 
to the EECO. Linear regression fits and coefficients of determination 

(R*) are also shown, with the 95% confidence interval (shaded bands). 

b, Apparent latitudinal SST sensitivity for the last 520 kyr (ref. 13). The 
dashed line is a second-order polynomial through the SST sensitivity 
data (grey crosses) of ref. 13, and the grey lines show the 95% confidence 
interval. A red rectangle surrounds the SST sensitivity estimates averaged 
as a low-latitude mean, and the blue line indicates the high-latitude mean 
(see text and Methods). c, Reconstructed (lines) and estimated (symbols) 
SST relative to 53.2 Myr ago. Symbols show each of our time slices, 
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calculated using the respective CO, reconstructions and the average low- 
(red) and high- (blue) latitude SST sensitivities of b. Bold lines and shaded 
uncertainty band (at 95% confidence) show the reconstructed long-term 
mean SST estimates using the TEXg¢ proxy at high (blue colour >55°) and 
low (red colour <30°) latitudes!‘ relative to the SST ~53 Myr ago. Error 
bars represent full propagation of errors at 95% confidence”. d, Range in 
mean surface temperature change for early (green) and late (black) Eocene 
corrected for changes due to slow feedbacks**?!?. e, Forcing compared 

to the pre-industrial period, calculated using our CO) reconstructions for 
the time slices 53.2 Myr ago (early Eocene) and 36.9 Myr ago (late Eocene) 
(see Methods). f, Probability density functions of ECS for the early (green) 
and late (black) Eocene compared to IPCC estimates (dashed lines show 
the 95% confidence interval (solid pink line)). 
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Extended Data Figure 6 | The cGENIE estimates of air-seawater CO, disequilibrium. The colour scale shows the difference between pCO, in air and 
pCO, in sea water. The model uses Eocene boundary conditions and positive values mean that seawater is a source of CO, (in parts per million); the star 
shows the palaeo-location of Tanzania. 
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Extended Data Figure 7 | Comparison of Eocene and modern 
planktonic foraminiferal 5!!B and 6!3C with 5180. a and b show 
analyses from the time slices 53.2 Myr ago and 40.3 Myr ago, respectively 
(as in Fig. 2 and Supplementary Table 1). Cibicidoides species (Cibs) are 
shaded in blue. c, Core-top (Holocene) offshore Tanzania foraminiferal 
measurements. Seawater §!'Byorate and 6!8O was calculated from 
temperature, alkalinity and dissolved inorganic carbon measurements 
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Extended Data Figure 8 | Reconstructed pH using different 
combinations of published symbiont-bearing and non-symbiotic 
foraminiferal vital-effect calibrations for the two most complete with 
depth time slices. a shows the case where no vital effect corrections were 
applied for comparison. The vital-effect corrections for deeper asymbiotic 
planktonic foraminifera are based on either Globigerina bulloides”® 

(b and d) and Neogloboquadrina pachyderma® (c and e) calibrations. 


e * 53.2Ma 
* 40.3 Ma 
@ 53.2 Ma (Cibs) 
X 40.3 Ma (Cibs) 


For shallow symbiont-bearing planktonics we used calibrations specific 

to modern shallow, symbiont-bearing Trilobatus sacculifer” (b and c) and 
Orbulina universa® (d and e) (as recalculated by ref. 64 and offset by —3%o 
to account for analytical differences between negative thermal ionization 
mass spectrometry and MC-ICPMS instrumentation®”’). For comparison 
we also show the pH reconstructions for the case where we assume 
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Extended Data Figure 9 | The effect of seawater composition on boron isotope calibrations in foraminifera. The example of T: sacculifer calibration” 
and the ‘borate’ calibration (assuming 6''B, = §!'Bporate) for modern and Eocene seawater compositions; see Methods and Supplementary Table 2. 
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Repeated large-scale retreat and advance of Totten 
Glacier indicated by inland bed erosion 


A.R. A. Aitken!, J. L. Roberts*?, T. D. van Ommen?”, D. A. Young’, N. R. Golledge®®, J. S. Greenbaum’, 


D. D. Blankenship? & M. J. Siegert’ 


Climate variations cause ice sheets to retreat and advance, raising or 
lowering sea level by metres to decametres. The basic relationship 
is unambiguous, but the timing, magnitude and sources of sea- 
level change remain unclear; in particular, the contribution of 
the East Antarctic Ice Sheet (EAIS) is ill defined, restricting our 
appreciation of potential future change. Several lines of evidence 
suggest possible collapse of the Totten Glacier into interior basins 
during past warm periods, most notably the Pliocene epoch! *, 
causing several metres of sea-level rise. However, the structure 
and long-term evolution of the ice sheet in this region have been 
understood insufficiently to constrain past ice-sheet extents. Here 
we show that deep ice-sheet erosion—enough to expose basement 
rocks—has occurred in two regions: the head of the Totten Glacier, 
within 150 kilometres of today’s grounding line; and deep within the 
Sabrina Subglacial Basin, 350-550 kilometres from this grounding 
line. Our results, based on ICECAP aerogeophysical data, demarcate 
the marginal zones of two distinct quasi-stable EAIS configurations, 
corresponding to the ‘modern-scale’ ice sheet (with a marginal zone 
near the present ice-sheet margin) and the retreated ice sheet (with 
the marginal zone located far inland). The transitional region of 
200-250 kilometres in width is less eroded, suggesting shorter-lived 
exposure to eroding conditions during repeated retreat-advance 
events, which are probably driven by ocean-forced instabilities. 
Representative ice-sheet models indicate that the global sea-level 
increase resulting from retreat in this sector can be up to 0.9 metres 
in the modern-scale configuration, and exceeds 2 metres in the 
retreated configuration. 

Satellite-based observations indicate that the margin of Totten 
Glacier may be experiencing greater ice loss than anywhere else in 
East Antarctica”®. This, coupled with the presence of low-lying sub- 
glacial basins upstream’*, means the Totten Glacier catchment area 
could be at risk of substantial ice loss under ocean-warming conditions. 
The vulnerability of this region to change might be driven by the 
entrance of warm modified circumpolar deep water to the Totten Ice 
Shelf cavity®'°, for which bathymetric pathways are known!’. 

Totten Glacier possesses an ice stream that extends far inland’?. 
This ice stream mostly overlies the Sabrina Subglacial Basin (SSB), 
which is a bowl-shaped depression bounded to the east by the Terre 
Adelie highlands, and to the west and south by the recently discovered 
Highlands B and C (Fig. 1). This region has widely distributed sub- 
glacial hydrology’® and little large-scale topographic relief*'*'*. The 
SSB is underlain by a sedimentary basin (the SSSB) with moderate but 
quite variable thickness (Fig. 1c). The SSSB probably dates back to at 
least 40 million years ago (Ma)'”, and therefore pre-dates the glaciation 
of the EAIS. 

Here we use geophysical data from the International Collaborative 
Exploration of the Cryosphere through Airborne Profiling (ICECAP) 
program (Extended Data Fig. 1) to define the erosion of the SSB by past 


EAIS activity. We use two-dimensional gravity modelling along flight 
lines, and also include estimates of depth to magnetic sources, to under- 
stand the thickness of sedimentary rocks in the SSSB (Fig. 1c). The 
thickness of the SSSB is defined with a typical accuracy of +600 metres. 

The gravity-model results show that the base of the SSSB sedimen- 
tary basin is tilted towards the south-southeast and extends south- 
east from the Totten Glacier for more than 500 kilometres. Its base is 
as deep as —4,000 metres elevation in parts, but is typically shallower 
(Fig. 1b). The pre-erosion basin structure is not uniquely defined, as the 
basin top is missing, but the least-eroded areas of the SSSB are at least 
3 kilometres thick and have a generally flat-based geometry. Faults 
define smaller-scale perturbations to this geometry. The Aurora 
Subglacial Basin (ASB) and Vincennes Subglacial Basin (VSB) regions 
have much larger thickness uncertainties, and we have not interpreted 
erosion patterns within these basins. 

Thickness trends for the SSSB (Fig. 1c) do not parallel the tectonic 
structure of the region (Extended Data Fig. 2). Thickness variations 
are high within tectonic blocks, and thickness trends transgress major 
tectonic structures'»’°, This suggests that present-day basin thickness is 
not dominated by tectonic structure; rather, the SSSB thickness defines 
distinct patterns that are explained by the erosion of the SSB by the 
EAIS, 

Glacial erosion occurs because of basal sliding, which requires warm- 
based ice and a driving stress that is sufficient to cause motion'’”. The 
erosion rate depends primarily on the basal velocity of the ice sheet!’ 
and because there is the potential for high velocities at the ice-sheet 
margin, so there is also the potential for enhanced erosion rates at the 
margin. Erosion may be selective or distributed. Under selective ero- 
sion, deep troughs occur in regions with high basal velocity’, but a 
lateral convergence of ice flux is also required, often focused within 
pre-existing valleys'®. Thinner ice promotes selective erosion because 
highlands are often cold-based and do not exhibit the fast basal veloci- 
ties necessary for substantial erosion; the same highlands may be warm- 
based under thicker ice cover, and may exhibit fast basal velocities. As 
a consequence, larger and thicker ice sheets are more likely to exhibit 
distributed erosion, whereas smaller and thinner ice sheets are more 
likely to exhibit selective erosion”. 

For the SSB we interpret several regions of distinctive erosion 
(Fig. 1d) on the basis of the morphology of the subglacial surface and 
the thickness of the SSSB (Extended Data Table 1). These regions define 
erosion from two distinct quasi-stable EAIS configurations: a ‘modern- 
scale’ configuration, with a marginal zone near the present-day ice-sheet 
margin; and a ‘retreated’ configuration, with a marginal zone located 
far inland. 

Cumulative glacial erosion is generally low in elevated coastal 
regions, including Law Dome, the Knox Coast and the Cape 
Goodenough region, and erosion is also generally low on the high- 
lands surrounding the SSB and ASB. Ice-sheet models suggest that these 
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Figure 1 | Interpretation of erosion in the Sabrina Subglacial Basin 
region. The Sabrina Subglacial Basin (SSB) is the low-lying region 
bounded by Highland B, Highland C and the Terre Adelie highlands. The 
SSSB is the associated sedimentary basin of moderate thickness. Much 
thicker sedimentary basins occur in the Aurora Subglacial Basin (ASB) 
and Vincennes Subglacial Basin (VSB) regions. a, Base-ice elevations 
indicate the ice-sheet bed. Data are from ICECAP and Bedmapz? (ref. 7) 
(the latter data have a muted appearance). b, c, The sedimentary-basin 
base elevation (b) and thickness (c), both derived from gravity modelling. 


highlands generally either are covered with slow-moving ice, or are ice 
free, with little scope for fast-moving ice (Fig. 3). 

Region A surrounds the coastal glaciers, and is a broad region of 
moderate elevation, cut by a number of smaller channels that align 
with modern ice-sheet flow'*. The SSSB thickness is nil for much of 
region A and rarely exceeds 1 kilometre (Fig. 1c). The A/B boundary 
is located near a topographic ridge that extends west-southwest from 
Cape Goodenough. This ridge averages about —200 metres of elevation, 
but is cut by channels as deep as —800 metres. The ice-sheet bed within 
region A is ocean sloping, with an average gradient (measured along 
flight lines) of +2 to +4mkm!. 

Region B is defined by an almost linear inland-thickening trend to 
the base of the SSSB, with an upstream limit defined where this thick- 
ening trend ceases (Figs 1 and 2). In region B, the thickness of the SSSB 
increases in co-variance with the reduction in surface ice-sheet velocity 
(Fig. 2 and Extended Data Figs 3-6). Non-selective and inland- 
reducing erosion, coupled with the correlation with present-day surface 
velocity, points to prolonged activity of the modern-scale ice sheet. 

Under present climate conditions, ice-sheet modelling indicates high 
basal velocities within region A (Extended Data Fig. 7a). With an air 
temperature anomaly, dT, of +4°C and an ocean temperature anomaly, 
dT, of +1°C (anomaly is with respect to present-day temperatures), a 
retreat of the ice-sheet margin to the A/B boundary is indicated, with 
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Hairlines in c show flight lines. d, We interpret regions of differing 
erosion characteristics for the SSB from the elevation and character of 
subglacial topography and from variations in the preserved thickness of 
the sedimentary basins (Extended Data Table 1). Heavy solid and dashed 
lines show the A/B and B/C boundaries, respectively. Thin lines show 
the interpreted boundaries of the dendritic erosion pattern for regions 

C and B1. Thin dotted lines indicate the Cape Goodenough Ridge. The 
inset shows the location of the region within Antarctica. MUIS, Moscow 
University Ice Shelf; VF, Vanderford Glacier. 


high basal velocities occurring mostly within region B (Fig. 3a). The 
results of these models suggest that repeated small-scale retreat and 
advance cycles within region A are necessary to explain the observed 
erosion. These cycles had a length scale of less than 200 kilometres, did 
not involve collapse, and were probably orbitally forced’. 

Region B comprises two subregions. Subregion B2 preserves 
only the modern-scale ice-sheet-erosion signature, and the SSSB is typi- 
cally more than 2 kilometres thick. In subregion B1, the SSSB is typically 
thinner and includes branched channels in addition to the modern- 
scale ice-sheet signature (Fig. 1). The ice-sheet bed in subregion B1 is 
beneath sea level and typically inland sloping, with an average gradient 
of —1to -3mkm'. 

Retreat into the SSB from the topographic ridge at the A/B boundary 
is subject to ocean-driven instabilities. These include the marine 
ice-sheet instability, moderated by ice-shelf buttressing”®?', and 
ice-cliff failure augmented by hydrofracturing*. The former mechanism 
is widely considered to be the main driver of ice-sheet collapse*’. The 
latter mechanism may yield rapid and extensive retreat, although fur- 
ther work is required to verify its role in large-scale ice-sheet collapse’. 

Following retreat into the SSB interior, a new ice-sheet margin is 
established in front of Highlands B and C. Region C (Fig. 1) is character- 
ized by sufficient erosion to expose basement, and an overall dendritic 
pattern. Patches of thicker sedimentary rocks exist between channels. 
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Figure 2 | Gravity model along flight line RO6Ea. a, Observed and 
calculated gravity disturbances, including model components from ice 
and topography, the deep crust and Moho, and the sedimentary basins. 
b, Results of the model, showing the thickness of sedimentary rocks 
required to satisfy the gravity field. Estimates of depth to magnetic 
source’, the tensioned spline fit, and cross-ties with other models are 
also shown. The gravity disturbance was modelled at true observation 
elevations, indicated by the uppermost grey line. c, SSSB thickness and 


Highland C and the Terre Adelie highlands peak well above sea level, 
and have average measured gradients on their frontal slopes of about 
+2.5mkm“!. Consequently, these highlands are persistent barriers to 
further ice-sheet retreat, even under highly unstable conditions*’’. 

Highland B, however, is pierced by several deep fjords, which indicate 
extended periods during which fast ice flow was channelled from a 
large upstream source in the ASB®. The fjords are also where ice-sheet 
retreat into the ASB is probably initiated. This additional vulnerabil- 
ity limits the residence time of an ice margin in front of Highland B 
as compared with Highland C**. Our observed erosion in region B1, 
which is less than the erosion in region C, suggests that retreat of the 
EAIS into the ASB has occurred as a characteristic of a fully retreated 
ice sheet. 

The erosion characteristics of regions B1 and C point to the activity 
of a retreated ice sheet. In our models, when dT, =+8°C and 
dT, = +2°C (Fig. 3b), the ice-sheet margin retreats to subregion B1, 
with high basal velocities mostly within subregion B1 and region C. 
When dT, = +12°C and dT, = +5 °C, further retreat to the B/C bound- 
ary is suggested (Fig. 3c). This retreat scenario is similar in extent to a 
previous interpretation® of the early ice sheet, before advance into the 
SSB, although the locations of the ice margins differ. With dT, =+15°C 
and dT, = +5 °C, full retreat into the ASB is indicated (Extended Data 
Fig. 7b). This retreat scenario is somewhat more extensive than a 
previous interpretation that considered the early ice sheet, before 
advance into the ASB®. 

Overall, the extent of SSB erosion points to long periods of time 
during which the SSB was subjected to a modern-scale ice sheet, with 
the margin located within region A (less than 150 kilometres retreat 
from present), and also large periods of time when it was subjected to 
a retreated ice sheet, with the margin located much further inland, near 
the B/C boundary (350 kilometres of retreat from present). Subregion 
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present-day surface-ice velocity!” (vertically flipped; average error in 
ice-sheet velocity on this line is 10.5 myr~'). A/B, B/C and C/HC 

indicate the locations of the interpreted erosion boundaries demarcated 
in Figs 1 and 3 (HC, Highland C). Dashed lines indicate the inferred 

prior 3-kilometre thickness of the SSSB, which has been almost entirely 
removed in region A, and eroded in line with present-day ice velocity in 
region B. See Extended Data Figs 3-6 for additional representative models 
throughout the SSB region. 


B1 is apparently less eroded than both regions A and C, suggesting 
that the residence time of the ice-sheet margin in this region has been 
less. This may indicate repeated but relatively short-lived transitions 
between the modern-scale and retreated states. 

Our interpretation is not time specific, but global climate and sea- 
level data suggest that the retreated ice-sheet was predominant in the 
Oligocene to mid-Miocene, with the modern-scale ice-sheet predom- 
inant since the mid-Miocene”’. Corresponding detrital provenance 
records at Prydz Bay’ and locally in sediment cores” suggest that 
basement rocks were being eroded from region A at 7 Ma and at 3.5 Ma 
(ref. 1). Therefore, the modern-scale ice-sheet has been a recurrent 
feature of Totten Glacier since well before 7 Ma. 

It has been suggested that a substantial Pliocene retreat here may 
have been necessary to generate the estimated high sea level of up to 
22 + 10 metres above present”, although a recent estimate is more 
moderate, at 9- 13.5 metres; ref. 25. Our data cannot directly constrain 
the extent of the Pliocene ice sheet at Totten Glacier, but we can esti- 
mate the contribution of this sector to global sea level for our repre- 
sentative ice-sheet models. 

Prior reconstructions of retreat from a modern-scale configuration 
under Pliocene conditions are highly variable*®. Models with a highly 
unstable ice sheet**®”” can retreat markedly, while more stable ice- 
sheet reconstructions typically fail to do so**®. Furthermore, different 
ice-sheet models can resolve similar ice-sheet extents with differing 
amounts of sea-level contribution*”®. There might also be short-term 
sea-level fluctuations that we do not consider here”’. 

The largest retreat possible under the modern-scale ice-sheet configu- 
ration (Fig. 3a) is associated with a total Antarctic sea-level contribution 
of 8.39 metres, of which the SSB/ASB sector provides 0.89 metres. 
Even allowing for 7.3 metres of sea-level increase from collapse of the 
Greenland ice sheet*, this is insufficient to explain 22 metres of total 
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a 110° E 120°E Figure 3 | Ice-sheet models with differing climate forcings. Ice-sheet 
Kilometres | surface and bed elevations and basal velocity contours are shown after a 
0 100 200 long-term 20,000-year run under constant climate forcing. Models were 


run with air temperatures (d7,) and ocean temperatures (dT,) above 
today’s. Contributions to global-mean sea level are estimated for all 
Antarctica (dV) and for the SSB/ASB sector (dVj; indicated by the green 
area in the inset). a, With dT, = +4°C and dT, = +1 °C, the ice-sheet 
margin is located near the A/B boundary and high basal velocities are 
focused in region B. b, With dT, = +8°C and dT, = +2 °C, the ice-sheet 
margin is located in region B and high basal velocities are focused in 
regions Bl and C. c, With dT, = +12°C and dT, = +5 °C, the ice-sheet 
margin is located near the B/C boundary and high basal velocities are 
focused in region C and the ASB. Further models are shown in Extended 
Data Fig. 7. m.s.Le., metres of sea-level equivalent. 
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sea-level rise~’, but it is sufficient to explain an increase of 13.5 metres 
(ref. 25). 

Our two models of the retreated ice sheet state (Fig. 3b, c) are asso- 
ciated with a total Antarctic sea-level contribution of 16.5 metres or 
21.5 metres, of which the SSB/ASB sector provides 2.18 metres and 
2.89 metres, respectively. Retreat of the ice sheet into the ASB (Extended 
Pignenies l Data Fig. 7b) is associated with a total Antarctic sea-level contribution 
aay ab of 29.1 metres, with 4.29 metres sourced from the SSB/ASB sector. 

The influence of Totten Glacier on past sea level is clearly notable, 
but for any particular warm period it is also highly uncertain, because 
the system is subject to progressive instability. Our results suggest that 
the first discriminant is the development of sufficient retreat to breach 
the A/B-boundary ridge. This causes an instability-driven transition 
from the modern-scale configuration to the retreated configuration. 
Under ongoing ice-sheet loss, the breaching of Highland B causes fur- 
ther retreat into the ASB. Each of these changes in state is associated 
with a substantial increase in both the absolute and the proportional 
contribution of this sector to global sea level. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data. Aerogeophysical data used include ice-surface, ice-thickness, magnetic and 
gravity data, all of which were collected through the ICECAP program during 
Antarctic summer seasons 2009/2010 through to 2012/2013. Instruments were 
flown aboard the DC-3T aircraft, civil registered as C-GJKB, owned and operated 
by Kenn Borek Air. Line data for these ICECAP products are available at full res- 
olution through the ICEBRIDGE data portal at the National Snow and Ice Data 
Center (http://nsidc.org/icebridge/portal/). 

The ice-surface elevation was obtained using a Rieg] LD90-3800-HiP-LR laser 
ranging system, combined with a range of GPS receivers and inertial measurement 
unit (IMU) systems. Spatial resolution is 25m along track and 1 m across track, 
with a net error of approximately 12 cm (ref. 31). Ice thickness was determined with 
the 60 MHz HiCARS ice-penetrating radar system*”. Crossovers in ice thickness 
yield an average difference of approximately 33 m (ref. 8); crossover difference 
scales as a function of basal roughness up to a few hundred metres. This error is 
due predominantly to uncertainty in the horizontal location of the reflection point, 
which occurs within a radius of about 500 m. 

Magnetic intensity data were measured with a Geometrics G-823A caesium 
vapour magnetometer housed in an aircraft tailboom*’. Raw total magnetic 
intensity data are reduced to magnetic intensity anomalies by correcting for the 
International Geomagnetic Reference Field, time variations in magnetic field, and 
spline-based line levelling’”. 

For the first three seasons, gravitational acceleration data were obtained with a 
Bell Aerospace BGM-3, two-axis-stabilized scalar gravity meter**. Gravitational 
acceleration was sampled at 1 Hz. Aircraft vertical and horizontal accelerations 
were estimated using dual carrier phase GPS solutions, and removed from the 
observed accelerations before filtering. A symmetrical finite impulse response 
low-pass filter with half-amplitude frequency point of 0.0054 Hz (185s) was used 
to smooth the resulting data. In the 2012/2013 season, the BGM-3 gravity meter 
was replaced with a three-axis-stabilized Canadian Microgravity GT-1A system”. 
Gravitational acceleration was sampled at an effective sampling rate of 18.75 Hz, 
and filtered with a filter length of 150s (0.006667 Hz). Typical aircraft velocity was 
90ms ‘, so this results in a minimum resolvable half-wavelength of approximately 
8km for BGM-3 data and 6.75 km for GT-1A data. 

The gravitational acceleration data were automatically edited on the basis of 

horizontal and vertical acceleration thresholds, and corrected for latitude, instru- 
ment drift, elevation, and the Eotvés effect. The final result is the disturbance from 
the global gravity field, or gravity disturbance. This is equivalent to the free-air 
anomaly. 
Gravity modelling approach and model uncertainty. For gravity modelling, 
we used the gravity disturbance data, with vertical gravitational accelerations 
calculated at observation elevation. For modelling along flight lines, we used an 
iterative combination of forward modelling and inversion, using the commercial 
two-dimensional code GM-sys which is based upon the principles of ref. 36. Full 
three-dimensional modelling is not appropriate for our survey design, owing to the 
sparse and irregularly spaced data and variably oriented flight lines. 

An initial model, described below, was perturbed using an iterative combination 
of inversion and manual forward modelling. The boundaries of the sedimentary 
basin base were inverted for depth using the inbuilt GM-sys inversion module, 
which uses a damped least-squares optimization*’ to estimate the thickness of the 
basin that minimizes gravity misfit. Inversion damping is not user specified, and 
results can be erratic. Therefore, in between inversion iterations, we carried out 
manual editing involving smoothing of anomalously steep or erratic results, and 
corrections so as to better match the ice-sheet-bed morphology where the thickness 
is close to zero. Most results were achieved within two or three iterations. 

This approach can successfully be used to model one-dimensional and two- 
dimensional structures, but not complex three-dimensional structures. Our 
approach is valid for the ice-sheet surface and the SSSB base, which are very close 
to one-dimensional, but some topographic features are three dimensional in form. 
In addition, we only resolve gravity features with widths of more than 6.75-8 km, 
whereas the ice-sheet bed is resolved at a much finer detail. Our models repre- 
sent this topography with a data spacing of roughly 2km. Combining the above 
leads to inaccurate basin-thickness models near rugged topography—such regions 
including the edges of Highland A, fjords, and deep channels. These regions are 
also most likely to possess large errors in the subglacial topography data set. In 
elevation terms, these errors are on average an order of magnitude less than our 
gravity modelling error, but in gravity terms they might be important owing to 
the high density contrast at this interface (~1,500kgm * versus ~300kgm_°). 

We constructed an initial model consisting of five layers: ice sheet, sedimentary 
basins, upper crust, lower crust and mantle. These layers exist, in order, across 
the entire model and cannot overlap, although they can become zero thickness. 
Boundaries for the ice-sheet surface and bed elevations were derived directly from 
the ICECAP line data, and for the ice sheet we used a density of 9220kgm_°. Density 


variations from the ice-sheet surface and bed are the most notable component of 
the gravity disturbance data (Fig. 2 and Extended Data Figs 3-6). The Moho was 
defined by a flexural model that accounts for ice and topographic loads. An elastic 
thickness of 25 km was used, although—because of the long-wavelength of the 
loads involved—little variation in Moho structure is observed between 10 km and 
50km elastic thickness. The lower crust had a density of 2,800 kg m~°, the mantle 
a density of 3,200 kgm _~*. Density variations from these interfaces largely balance 
out the long-wavelength gravity effects of ice-thickness and topography variations 
(Fig. 2 and Extended Data Figs 3-6). None of these layer boundaries was a variable 
in the gravity model. 

Our target interface for modelling is the base of the sedimentary basins. We 
precondition the model using previously published depth-to-magnetic-source 
estimates from the ICECAP data set!*. These source points were derived from 
multiple passes of Werner deconvolution’® using the levelled magnetic line data. 
As an initial surface, we infer the base of the sedimentary basin at the uppermost 
resolved magnetic source, under the condition that the slope of the basin base 
was not excessive. Where the bed is typically deep, we ignored responses at the 
ice-sheet bed. Substantial scatter is evident in the depth-to-source estimates, but 
nonetheless they define the overall geometries of the SSSB and the sedimentary 
basins in the ASB and VSB’. Except for initializing the geometry of this surface, 
we did not use the depth-to-magnetic-source estimates to constrain the geometry 
of the returned model surface. 

We assumed, in the first instance, a crystalline basement density of 2,670kgm ~~. 
Sedimentary rock density was varied between a lower limit of 2,200kgm~° and 
an upper limit of 2,500 kgm ~°, providing an effective density contrast range of 
300kgm_°. The lower limit was based on a reasonable maximum porosity of 
25%, given the thickness of ice and the likely age of the rocks; the upper limit was 
defined by the point at which basin thickness departs considerably from magnetic 
depth-to-basement estimates’°. These sedimentary densities provide the largest 
uncertainty regarding basin thickness and create the ranges of sediment thickness 
shown in Fig. 2c and Extended Data Figs 3c, 4c, 5c and 6c. In the deeper parts of 
the SSB, the ASB and VSB, this uncertainty can approach several kilometres. Along- 
line analyses for our selected profiles give a length-weighted mean uncertainty of 
+1,239 m, but the length-weighted mean value for the SSB region is +609 m. 
Although large, the error is a systematic uncertainty that varies linearly and predict- 
ably with basin thickness, and has no influence on the pattern of basin thickness, 
only the magnitude. Consequently, our interpreted erosion boundaries are not 
especially sensitive to this uncertainty, but estimates of the amount of erosion are. 

A limitation of the gravity method is that only relative gravity differences are 
modelled, and results for each line are not directly comparable. To correct for 
this, we levelled the results to a common baseline. We reduced cross-line differ- 
ences by applying a constant value to a tensioned spline fit through the basin base 
on each line, so as to minimize the misfit between that line and all cross-ties. 
These cross-ties (Extended Data Fig. 8) have a mean cross-tie difference—that is, 
the total difference in thickness—of 607 m (or 4303.5 m), and a median cross-tie 
difference of 417 m (or +208.5 m). The highest values are associated with rugged 
topography, certain flight lines, and the deepest basin regions. Some lines traverse 
long sections without cross-ties, in particular at the ends of the lines. It is hard to 
numerically assess the effect of the lack of tie-points; however, the isolated inter- 
sections at the south of the image do not show atypical error, although they are 
above average (Extended Data Fig. 8). The central SSB provides consistently low 
cross-line differences (Extended Data Fig. 8). This error encapsulates most of the 
non-systematic errors in the gravity modelling, including those associated with 
gravity data accuracy, inversion instabilities, three-dimensional gravity effects, 
topographic errors, and undersampling of small features. The cross-tie errors sug- 
gest that, for a particular density contrast, the sedimentary basin thickness can 
usually be defined to within +300 m. 

A final consideration is the potential influence of lateral variations in density 
within the region. We note that the gravity data contain the summed gravity field 
of all sources. Thus we have the fundamental ambiguity that, for any given lateral 
density distribution, a basin thickness could be found to satisfy the gravity field, 
and vice versa. In the absence of firm information on these lateral variations, we 
have adopted a simple approach to modelling the thickness of sedimentary rocks. 
Nevertheless, lateral variations in density are undoubtedly present. Lateral varia- 
tions in sedimentary rock density will occur within the previously defined bounds 
unless they are highly unusual. Basement density variations may also be important, 
and can be classed either as intra-block or inter-block variability. 

Intra-block variability is hard to predict and to test, and cannot be excluded 
here, but typically is not a major influence on the large-scale gravity field unless 
large mafic intrusions exist. The error from internal block variability may be com- 
parable in magnitude, but not in pattern, to the variability in sedimentary basin 
density. For example, a large, 5-km-thick tabular body with a density contrast of 
50kgm~? is equivalent to a +500_m uncertainty in basin thickness with density 
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contrast of 250kgm~°. In the case of the SSSB, the observed intra-block gravity 
variations show correlation with the subglacial topography, which includes fairly 
well defined glacial landscapes, and, for regions A and B, correlates also with the 
present ice-sheet velocity. An igneous or metamorphic explanation demands an 
unlikely coincidence, and so we consider it unlikely that basement density varia- 
tions are the main source of intra-block gravity variability. 

Inter-block density variations are potentially more important. Owing to the 
size and depth extent of crustal blocks, even small variations between them can be 
substantial. Extended Data Fig. 1 shows that the SSB region possesses a large gravity 
disturbance high, whereas the Aurora and Vincennes Subglacial Basin regions 
possess large gravity disturbance lows. The Knox Coast and Law Dome have rela- 
tively high gravity, and the Terre Adelie highlands are mixed in response. Tectonic 
interpretations based predominantly on magnetic data, but also on gravity data’, 
indicate that in this region there are several major tectonic structures—including 
the Totten Shear Zone, along Totten Glacier; the Aurora Fault and the Indo-Australo- 
Antarctic Suture, which bound the ASB to the northwest and southeast respectively; 
and the Frost Fault, which bounds the Terre Adelie highlands. Several unnamed 
faults subdivide the Mawson Craton!?. 

A recent model of Australia’s crust*’ returns vertically averaged crustal density 
anomalies for the comparable region that range from +0.2% to +3.8%, with a 
mean of +2.0% and a standard deviation (c) of 0.58%. Applying those 20 limits 
with a base density value of 2,670kgm ~*, the overall basement density range is 
approximately 60kgm >. We test the impact of a comparable density range on our 
basin models by applying reasonable basement density contrasts of +25kgm~°. 
Models along flight lines RO3Ea, RO6Ea and RO8Eb allow comparison with the 
unperturbed models (Fig. 2 and Extended Data Figs 4, 5). The Y07b and GL0092a 
lines did not cross sufficient major faults to be included here. 

Within these models, each major crustal block is assigned a density of 
2,670kgm_°, 2,645kgm ° or 2,695kgm_*. To test our models’ robustness, we 
seek the density configuration within these bounds that leads to the flattest basin 
morphology (Extended Data Fig. 9). Models were inverted once, with no subse- 
quent smoothing, so the results are slightly more rugged than the initial geometry 
(Extended Data Fig. 9). Automated adjustment of data shift was applied, and so 
even blocks without changes to density contrast can have changed basin geometries. 

Under this testing, substantial changes are made to the VSB and Terre Adelie 
regions, and to the thickness variation between these regions and the SSB. For the 
VSB and ASB regions, and for the Terre Adelie region, it is likely that inter-block 
basement density variations contribute to the gravity anomaly, although it is not 
known to what extent. This, combined with the large thickness of the basins, raises 
uncertainty as to the basin geometry. 

For the SSB, which is now underlain by a dense crustal block, substantial 

changes to thickness are observed, but the pattern of basin thickness remains 
similar (Extended Data Fig. 9). In particular, the inland slope within regions A 
and B and the contrast between regions B1 and C are persistent features. The 
A/B boundary shows no correlation with any major tectonic boundary. On line 
RO8EDb, and to a lesser degree on line RO6Ea, the B/C boundary is correlated with 
a notable basement fault. Such a correlation is consistent with the overdeepening 
of a fault-related topographic depression during glacial activity, and also with the 
subsequent influence of fjord formation on EAIS erosion. This link with basement 
structure does not necessarily imply that the observed thickness variations are 
tectonic in origin. The same fault does not correlate to a significant thickness 
variation on line RO3Ea. 
Ice-sheet modelling. We follow an established methodology in our experiments”, 
and adopt the same present-day input climatology and tuning parameters (stress 
balance and basal resistance prescription). The following aspects of our new 
experiments differ, however. Our experiments start from a thermally equilibriated 
ice-sheet geometry and run for 20,000 years (20 kyr), so that the modelled ice-sheet is 
at (or close to) equilibrium. Climate perturbations are applied as linear changes from 
present-day conditions, between 2,000 and 3,000 model years. The first 2,000 years 
allow any transient behaviour associated with model initialization to take place in 
the absence of environmental perturbations, whereas the subsequent 1,000 years 
force the ice sheet to evolve slowly to changes in air and ocean temperatures and 
precipitation. The remaining 17 kyr allow a steady state to be reached. All experi- 
ments are run at a spatial resolution of 20 km. 

As previously’, we use a sub-grid-scale basal-traction and driving-stress inter- 
polation scheme to allow realistic grounding-line motion*®. However, we choose 
not to implement the sub-grid-scale interpolated ice-shelf basal melt component 
of this scheme, owing to uncertainties as to its validity and the possible resolution 
dependence of model results”**'. Calving is parameterized using horizontal strain 
rates and a minimum thickness criterion”. 

Surface mass balance depends on monthly climatological data and a positive 
degree-day model that tracks snow thickness and allows for melting of snow 
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and ice at3mm°C~' day! and 8mm °C! day“! respectively. We incorporate a 
white-noise signal (normally distributed; mean random temperature increment 
of zero) into the calculation of daily temperature variations. The standard devi- 
ation of daily temperature variability is set at 2°C, somewhat lower than the 
commonly used value of 5°C, on the basis that the latter has a tendency to over- 
estimate melt****. Surface temperatures are adjusted for elevation according to an 
altitudinal lapse rate of -8 Kkm“, and a refreezing coefficient of 0.6 is used to 
mimic meltwater capture within the snowpack. 

The sea-level contribution for the whole of Antarctica is derived directly from 

the numerical ice-sheet model”. For the Aurora and Sabrina basins sector, we 
derive the relative sea-level contribution within the region defined by the longi- 
tudinal limits of 102.5° E and 127.5°E, and north of 80°S. These limits encom- 
pass the Totten Glacier catchment, with minimal effects from including parts of 
adjacent regions. We extend the region north into the ocean 60°S so as to capture 
any isostatic bathymetric effects offshore. For each model result, we compute 
the remaining mass-loss potential, defined by the difference between the mass of 
the ice sheet, and the mass of water required to inundate regions below sea level. 
The difference in mass-loss potential between models returns the mass that has 
been lost or gained. This mass difference was converted to an equivalent seawater 
volume, and divided by the global ocean area (3.6 x 10'4 m’) to estimate global 
sea-level change. For consistency in resolution, we use the model with present-day 
climate forcing as the baseline, rather than the present-day ice sheet. 
Code availability. All geophysical processing and modelling used commercially 
available software, including ArcGIS from ESRI and Oasis Montaj from Geosoft. 
Ice-sheet modelling used the PISM code which is available from https://github. 
com/pism/pism. 
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Extended Data Figure 1 | ICECAP geophysical data. a, Gravity disturbance data. b, Magnetic intensity data. Dashed red lines indicate interpreted 
basement faults’*. Solid, dashed and dotted blue lines indicate the erosion interpretation as shown in Fig. 1. 
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Extended Data Figure 2 | SSSB base morphology. The map shows the (Fig. 2), RO8Eb (Extended Data Fig. 5) and GL0092a (Extended Data Fig. 6). 
relationship of the SSSB base morphology with basement faults (dashed Arrows indicate the right of the associated model figure in each case. The 
black lines), and the locations of the representative model profiles: flight interpreted erosion boundaries are also shown, in grey, with solid, dashed 
lines YO7b (Extended Data Fig. 3), RO3Ea (Extended Data Fig. 4), RO6Ea and dotted lines as in Fig. 1. 
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observed and calculated gravity disturbance, including model components _ (vertically flipped). Basin thickness is shown for a density contrast of 


from ice and topography, the deep crust and Moho (‘mantle and lower 


270kgm_~? (2,400 kg m~), and a thickness range for density contrast of 


crust’), and the sedimentary basins. b, The model, also showing estimates 170-470 kg m~? (2,200-2,500 kg m~*). Average error in ice-sheet velocity 


of depth to magnetic basement, the tensioned spline fit, and cross-ties 
with other models. Ice, sedimentary rock and basement densities are 


on this line is 8.4m yr7! (ref. 12). 
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Extended Data Figure 4 | Gravity model along flight line RO3Ea. 

a, The observed and calculated gravity disturbance, including model 
components from ice and topography, the deep crust and Moho (‘mantle 
and lower crust’), and the sedimentary basins. b, The model, also showing 
estimates of depth to magnetic basement, the tensioned spline fit, and 
cross-ties with other models. Ice, sedimentary rock and basement densities 


are as per Fig. 2. c, SSSB thickness and present-day surface ice velocity 
(vertically flipped). Basin thickness is shown for a density contrast of 
270kgm * (2,400kgm~*), and a thickness range for density contrast of 
170-470 kgm? (2,200-2,500 kg m~*). Average error in ice-sheet velocity 
on this line is 9.7 myr7' (ref. 12). TAH, Terre Adelie highlands. 
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Extended Data Figure 5 | Gravity model along flight line ROSEb. a, The per Fig. 2. c, SSSB thickness and current surface ice velocity (vertically 
observed and calculated gravity disturbance, including model components _ flipped). Basin thickness is shown for a density-contrast of 270kgm~° 


from ice and topography, the deep crust and Moho (‘mantle and lower (2,400 kg m_°), and the thickness range for density contrast of 170 to 
crust’), and the sedimentary basins. b, The model, also showing depth 470kgm_~* (2,200 to 2,500 kgm °). Average error in ice-sheet velocity on 
to magnetic basement estimates, the tensioned spline fit, and cross-ties this line is 8.6 myr™' (ref. 12). HC, Highland C. 


with other models. Ice, sedimentary rock and basement densities are as 
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Extended Data Figure 6 | Gravity model along flight line GL0092a. 
a, The observed and calculated gravity disturbance, including model 
components from ice and topography, the deep crust and Moho (‘mantle 
and lower crust’), and the sedimentary basins. b, The model, also showing 
estimates of depth to magnetic basement, the tensioned spline fit, and 
cross-ties with other models. Ice, sedimentary rock and basement densities 
are as per Fig. 2. c, SSSB thickness and present-day surface ice velocity 
(vertically flipped). Basin thickness is shown for a density contrast of 
270kgm ° (2,400kgm_°), and a thickness range for density contrast 
of 170-470 kg m~? (2,200-2,500 kg m~°). Average error in ice-sheet 
velocity on this line is 10.6 m yr! (ref. 12). HB, Highland B; ASB, Aurora 
Subglacial Basin. 
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Extended Data Figure 7 | Additional ice-sheet models. Ice-sheet surface also have high basal velocity. b, With dT, = +15 °C and dT, = +5 °C, the 


and bed elevations and basal velocity contours are shown after a ice-sheet margin has retreated deep into the ASB and is preserved mainly 
long-term, 20-kyr run under constant climate forcing. a, With air and on inland highlands. Sea-level contributions are defined for all Antarctica 
ocean temperatures identical to today’s (dT, =0°C and dT, = 0°C), the (dV) and for the ASB/SSB sector (dV), denoted by the green region in 
ice-sheet margin is located near its present location, and high basal the inset. 


velocities are focused in region A. This model includes ice shelves, which 
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Extended Data Figure 8 | Model intersection misfits for the gravity 
models. The dots show the value in metres of each tensioned spline mis-tie 
after model levelling was applied. Levelling applied a constant value to 

the base of the sedimentary basin on each line to achieve an optimal 
minimum-norm fit to all cross-ties. Rugged subglacial topography 
(background) is the main source of non-systematic model errors because 


of three-dimensional effects. 
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figures. Fault locations are derived from the interpretation shown 
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Extended Data Table 1 | Zonal characteristics of topography and sedimentary rock thickness 


; Bed roughness and 
Subglacial topography Sedimentary rock thickness basal reflectivity” 


SSB Deep troughs, reaching -2400m elevation | Zero to ~ 1 km, Moderate to high bed 
regionA | associated with major glaciers (Totten, Thinnest at the head of Totten roughness 

Hinterland region of generally elevated Thickens towards South-east 

(ca. -200m) terrain, with a hilly to Thick beneath floating ied 


lineated texture, cut by several channels 
shelves 
with up to 600m relief. 


These channels are aligned with present- 
day ice flux. 


SSB Lower elevation region (ca. -500 m) cut Ca. 2-3 km in centre of the zone. | Moderate to low bed 
region B1 | by two ~40 km wide major channels. The | Less where channels present, roughness 
main region has a smooth texture. ca. 1-1.5 km, and significantly Low basal reflectivity 
Western channel leads to the ASB via thinner at zones of confluence 
Highland B, and deepens inland from - (<500m) 


800m to -1000m in elevation. 


Eastern channel at ca. -800m elevation 
leads to region C. This channel possesses 
several branches leading to the east 


channels. Moderate basal 
reflectivity 


SSB Moderate to very low elevation (0 to - Typically <500 m. with patchy Moderate to high bed 
region C 1200 m). Strongly sloping towards the thicker areas (1-2 km) between | roughness 
centre of region C. channels. Low basal reflectivity 
Several branches lead up to Highland C One large 3km thick area 


SSB More elevated regions (-400 m to Om), Ca 1.5-2 km in northwest, 2.5 -3. | Moderate to high bed 
region B2 | sloping towards the centre of the SSB. km in southeast roughness, especially 
Moderately hilly texture with no major >4 km beneath Highland B at short wavelengths 


and Terre Adelie highlands, with an located between channels at 
overall dendritic pattern. boundary with region B1. 


ASB Very low elevation (-1000 to -1500 m). Very thick (> 5 km) except in Very low bed 
Smooth texture. north (2-3 km), and generally roughness 
Central ridge (-700 to 0 m) separates two _| thickens to the south High basal reflectivity 
deep channels. 


Low elevation (-400 to -800 m), fairly Thick (3-6 km) and generally Low to very low bed 
smooth texture. No major channels. thickens to the southwest roughness 


High basal reflectivity 
Highland | Elevated (>-100m elevation) and typically | Low or zero thickness on Ridge _| Typically high bed 
Regions rough textures. Cut by channels and B, Dome C, Knox Coast. roughness 


fjords near ASB. Moderate to thick thickness on | Typically low basal 
Highlands A, B, C, Law Dome reflectivity 
and Terre Adelie Highlands 


The listed variations in the characteristics of subglacial topography and the preserved thickness of sedimentary rocks were used to define the zones identified in Fig. 1. The table also documents basal 
roughness and basal reflectivity characteristics from prior studies!%, although these were not used to guide our interpretation. 
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Metabolic acceleration and the evolution of human 


brain size and life history 


Herman Pontzer!*, Mary H. Brown’, David A. Raichlen*, Holly Dunsworth°, Brian Hare°, Kara Walker®, Amy Luke’, 
Lara R. Dugas’, Ramon Durazo-Arvizu’, Dale Schoeller®, Jacob Plange-Rhule’, Pascal Bovet!®"'", Terrence E. Forrester’, 
Estelle V. Lambert'’, Melissa Emery Thompson", Robert W. Shumaker!>»!°-” & Stephen R. Ross? 


Humans are distinguished from the other living apes in having 
larger brains and an unusual life history that combines high 
reproductive output with slow childhood growth and exceptional 
longevity’. This suite of derived traits suggests major changes 
in energy expenditure and allocation in the human lineage, but 
direct measures of human and ape metabolism are needed to 
compare evolved energy strategies among hominoids. Here we used 
doubly labelled water measurements of total energy expenditure 
(TEE; kcal day~') in humans, chimpanzees, bonobos, gorillas 
and orangutans to test the hypothesis that the human lineage 
has experienced an acceleration in metabolic rate, providing 
energy for larger brains and faster reproduction without 
sacrificing maintenance and longevity. In multivariate regressions 
including body size and physical activity, human TEE exceeded 
that of chimpanzees and bonobos, gorillas and orangutans by 
approximately 400, 635 and 820 kcal day '" respectively, readily 
accommodating the cost of humans’ greater brain size and 
reproductive output. Much of the increase in TEE is attributable 
to humans’ greater basal metabolic rate (kcal day~'), indicating 
increased organ metabolic activity. Humans also had the greatest 
body fat percentage. An increased metabolic rate, along with 
changes in energy allocation, was crucial in the evolution of human 
brain size and life history. 

Variation in life history reflects evolved differences in energy 
expenditure. Each organism must allocate its available metabolic 
energy, which is largely a function of body size, to the competing needs 
of growth, reproduction and maintenance, resulting in fundamental 
trade-offs among these vital tasks”°. For example, species that repro- 
duce faster than expected for their body mass generally have shorter 
maximum lifespans, as energy is directed towards reproductive out- 
put and away from maintenance” °. Among primates, this trade-off 
framework has been expanded to consider the energy needed to grow 
and maintain large brains®’. 

In this light, humans present an energetic paradox. Humans in natu- 
ral fertility populations reproduce more often, and produce larger neo- 
nates, than any other living hominoid, yet humans also have the longest 
lifespans and the largest, most metabolically costly brains’ (Extended 
Data Fig. 1). This uniquely human suite of derived, metabolically costly 
traits suggests a lifting of energetic constraints in the hominin line- 
age, but the critical underlying mechanisms remain largely unknown. 
Some have hypothesized that a reduced gut® or increased locomotor 
efficiency’ provided the extra energy needed for brain expansion. 


However, phylogenetically informed analyses suggest that gut reduction 
is insufficient to explain increased human brain size’, and, although 
human walking is more economical’®, traditional hunter—gatherers 
travel so much farther per day"! that their daily ranging costs are 
no lower than wild chimpanzees’ (Supplementary Discussion and 
Supplementary Table 1). Similarly, provisioning of young offspring 
and their mothers helps to shorten human inter-birth intervals and 
increase the pace of reproduction’, but reproduction remains relatively 
costly for human mothers (Supplementary Discussion, Supplementary 
Fig. 1 and Supplementary Table 2). Furthermore, it is unclear whether 
food sharing or other dietary changes are sufficient to fuel larger brains, 
larger neonates and longer lifespans without an acceleration in meta- 
bolic rate to harness increased energy acquisition. 

Here, we test the hypothesis that humans have evolved an acceler- 
ated metabolic rate and larger energy budget, accommodating larger 
brains, greater reproductive output and longer lifespans without the 
expected energetic trade-offs. Increased TEE (kcal day~*) has pre- 
viously been discounted as an explanation for the human energetic 
paradox’, in part because human basal metabolic rate (BMR; 
kcal day~') is broadly similar to that of other primates®*'’. However, 
variation in TEE and BMR among humans and the great apes, our 
closest evolutionary relatives, is largely unstudied. Lacking sufficient 
data on TEE and BMR in apes, previous analyses have been unable to 
compare metabolic rates across the entire hominoid clade and test for 
metabolic acceleration in humans. 

We used the doubly labelled water method” to measure TEE in 
mixed-sex samples of adult chimpanzees (Pan troglodytes; n= 27), 
bonobos (Pan paniscus; n= 8), Western lowland gorillas (Gorilla 
gorilla; n= 10) and orangutans (Pongo spp.; n= 11), and compared 
these data to similar measures of TEE ina large, adult human sample!? 
(Homo sapiens; n= 141); this method also provides a measure of body 
composition (Methods and Table 1). TEE was measured over 7-10 days 
while individuals followed their normal daily routine. We also com- 
pared published measurements of BMR in humans, chimpanzees and 
orangutans, and we estimated daily locomotor energy expenditure 
and BMR for adults in our TEE sample to assess their contribution to 
variation in TEE. Comparisons were performed at the genus level, both 
to avoid the issue of close phylogenetic relatedness for chimpanzees 
and bonobos, and because no metabolic differences were apparent 
between these two species. 

Humans exhibited greater TEE than other hominoids, with 
larger daily energy budgets than all apes except adult male gorillas 
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Table 1 | Age, body size and composition, and TEE 


LETTER 


Genus Sex n Age (years) (s.d.) Mass (kg) (s.d.) = FFM (kg)(s.d.) — Body fat* (%) (s.d.) TEE (kcal day~!) (s.d.) BMR#* Walk and climb+ 
Homo Female 76 33.5 (6.0) 80.1 (19.8) 46.1 (7.4) 41.1(7.1) 2,190 (381) 1,474 183 

Male 65 32.6 (5.4) 73.8 (15.2) 56.1 (7.9) 22.9 (7.6) 2,721 (569) 1,640 226 
Pan Female 17 25.8 (11.3) 46.4 (8.1) 43.3 (7.8) 9.0 (5.5) 1,722 (363) 1,214 102 

Male 18 22.3 (10.7) 57.9 (13.4) 53.8 (11.7) 8.4 (4.9) 2,145 (546) 1,401 120 
Gorilla. Female 6 19.7 (4.1) 73.7 (7.3) 63.3 (7.0) 13.9 (6.4) 2,030 (527) : 110 

Male 4 24.5 (11.7) 166.4 (42.5) 148.3 (27.0) 15.2 (4.6) 3,630 (339) : 253 
Pongo Female 5 27.6 (5.3) 58.2 (4.0) 44.3 (2.5) 23.4 (8.8) 1,476 (268) 984 88 

Male 6 20.0 (10.8) 76.7 (34.1) 62.4 (23.3) 16.4 (8.6) 1,617 (289) 1,090 163 


s.d., standard deviation. 
*Excludes n= 6 Pan subjects with negative calculated body fats (Methods). 
tCalculated for Pan, Gorilla, and Pongo using equation 7.17 in ref. 12. 


+Estimated BMR and locomotion costs were calculated from body mass and daily activity (Methods). 


(Table 1). Fat-free mass (FFM) explained 34% of the variance in TEE 
(t(195) = 10.18, 7? =0.34, P< 0.001; Fig. 1 and Supplementary Table 3). 
Differences among genera accounted for an additional 27% of the 
variance in TEE, with humans having the highest TEE, followed in 
order by Pan, Gorilla and Pongo (P< 0.001 for all comparisons; Fig. 1 
and Supplementary Table 3). For comparison, mean FFM was similar 
for Homo, Pan and Pongo cohorts, yet mean human TEE was ~27% 
greater than Pan and ~58% greater than Pongo. Body fat was 
weakly, negatively correlated with TEE (G= —0.04+ 0.02, P=0.02); 
age (P=0.70) and sex (P= 0.26) were not significant covariates in TEE 
models including FFM, fat mass and genus (Supplementary Table 3). 
Results were robust across different models for calculating TEE from 
isotope enrichment (Supplementary Table 3). 

Together, BMR and daily locomotor expenditure accounted for 
much of the variation in TEE among genera. The mean estimated 
daily energy cost of walking and climbing among apes in our sample 
was similar across genera and ~100 kcal day * below estimated daily 
walking cost in our human sample (Table 1). Our analyses of pub- 
lished measurements'*~!° indicate that BMR is lower in Pan than in 
comparably aged humans, in analyses controlling for sex, age and body 
mass (P< 0.001; Methods and Extended Data Fig. 2). The few meas- 
urements of Pongo BMR!“'’ appear to be lower still (Extended Data 
Fig. 2); no data for the Gorilla BMR are available. For cohorts in the 
TEE data set, estimated BMR for Homo was ~200 kcal day! greater 
than Pan and ~500kcal day“! greater than Pongo (Table 1). Estimated 
physical activity levels (PALs: TEE/estimated BMR; see Table 1), were 
similar across genera (Homo: female 1.5, male 1.7; Pan: female 1.4, 
male 1.5; Pongo: female 1.5, male 1.5), indicating that organ activity 
contributed more than physical activity to variations in TEE. 

Body fat percentage, measured from isotope dilution'*, was mark- 
edly higher in humans than other hominoids in our TEE sample, and 
only humans exhibited a significant sex difference (Table 1). Body fat 
percentage was remarkably low for Pan, consistent with measures from 
dissection'’, and somewhat higher for Gorilla and Pongo. Indeed, body 
fat percentages for captive apes were comparable to, or even below, 
average body fat percentages for humans in physically active, tradi- 
tional hunter-gatherer populations’. 

Together, these results indicate substantial metabolic evolution 
across the hominoid clade and a previously uncharacterized diver- 
sity in TEE, BMR and body fat percentage (Fig. 2). For a given FFM, 
humans have greater TEE, BMR and fat mass than other hominoids. 
Chimpanzees and bonobos have the next largest energy budgets but 
carry the least amount of body fat. Gorillas have lower TEE and greater 
body fat than Pan. Orangutans have exceptionally low TEE and BMR, 
and fat percentages similar to gorillas (Table 1). 

Comparisons of BMR and TEE indicate that hominin brain and life 
history evolution were fuelled in large part by increased mass-specific 
organ metabolic rates (kcal g~'h~'), and not solely through anatom- 
ical or physiological trade-offs. BMR reflects the summed metabolic 
activity of the organs at rest and is largely shaped by organs with high 
mass-specific metabolic rates such as the brain, liver and gastrointestinal 
tract”°. If, as assumed in previous studies, organ-mass-specific meta- 
bolic rates were similar across hominoids, then humans and apes with 


the same FFM should have similar BMRs — the cost of humans’ larger 
brain offset by a reduced gastrointestinal tract®. Instead, BMR and TEE 
are substantially greater for humans, controlling for FFM (Table 1, 
Figs 1, 2 and Extended Data Fig. 2), indicating an evolutionary increase 
in the metabolic activity of at least some organ systems. Humans’ 
greater BMR and TEE could readily accommodate the marginal 
cost of increased reproductive output, estimated at ~130 kcal day“! 
over the course of a woman's reproductive career in natural fertility 
populations (Supplementary Discussion, Supplementary Table 2 and 
Supplementary Fig. 3). Greater metabolic activity could also support 
humans derived brain metabolism?’ and increased somatic mainte- 
nance and longevity’, but more comparative analyses are needed to 
identify specific mechanisms and associated metabolic costs. 
Analysing relatively sedentary human and ape populations allowed 
us to compare cohorts with similar FFMs, PALs and locomotor 
expenditures (Table 1), but the pattern of TEE, BMR and body com- 
position variation among genera is expected to hold across more 
physically active populations. When controlling for body size, TEE 
is remarkably consistent across populations within a species, regard- 
less of activity level**: traditional hunter-gatherers have similar size- 
corrected TEEs to people living in cities'’, and zoo-housed popula- 
tions of primates and other mammals are similar to their counterparts 
in the wild” **. Humans in traditional foraging and farming popula- 
tions and apes in the wild will generally have lower body fat percent- 
ages’? than the cohorts in the TEE sample (Table 1), but again the 
pattern of differences seen here is expected to persist: even in captivity, 


3.5 4.0 4.5 5,0 
In FFM (kg) 


Figure 1 | TTE and FFM for hominoids. Humans (Homo, grey, n= 141), 
chimpanzees and bonobos (Pan, blue, n = 35), gorillas (Gorilla, green, 

n= 10), and orangutans (Pongo, orange, n= 11). Lines and shaded regions 
indicate least squares regressions and 95% confidence intervals for each 
genus. TEE for Homo exceeds other genera in a general linear model 
accounting for FFM, fat mass and other variables (P < 0.001; Table 1 and 
Supplementary Table 3). 
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Figure 2 | Predicted TEE, BMR and fat mass for adult hominoids. Values 
are estimated for males (55 kg FFM) and females (45 kg FFM), using the 
same FFM across genera. TEE (red bars) is estimated from FFM, fat mass 
and genus using model C in Supplementary Table 3; error bars represent 
model standard error. BMR (darker red regions) is estimated from body 
mass (Methods and Extended Data Fig. 2); no BMR data are available 

for Gorilla. Fat mass (yellow bars) is calculated from FFM using body fat 
percentages in Table 1. 


body fat percentage in Pan is remarkably low and fat percentages for 
Pongo and Gorilla fall near the very lowest ranges for healthy, physi- 
cally active human populations’. 

Expansion of the hominin energy budget challenges current models 
of life history and brain evolution predicated on metabolic stasis and 
constraint!*~°. Evolved increases in BMR and TEE do not diminish 
the importance of gut reduction, efficient walking, dietary change and 
maternal provisioning in the hominin lineage, but rather place these 
critical adaptations within a different energetic framework. Increased 
TEE exposes individuals to a greater likelihood of energy shortfalls, 
providing strong selection for behavioural and anatomical adapta- 
tions to mitigate this risk. Gut reduction® allows for greater energy 
allocation to reproduction and maintenance (including larger brains) 
while limiting the increase in BMR and TEE. Improved walking 
efficiency’®, a dietary shift to more energy-dense foods (for example, 
meat, tubers), and the adoption of cooking” all effectively increase the 
net energy gained from foraging, and may have had an essential role 
in the evolutionary expansion of the hominin energy budget. Food 
sharing’ provides an additional source of food energy, and along with 
increased body fat’, provides an important buffer against energy short- 
falls, particularly for children and for females shouldering the added 
energetic burdens of reproduction. We hypothesize that food sharing 
and increased body fat coevolved with greater TEE to mitigate the 
inherent risk of increased energy demands. 

Metabolic measurements of living hominoids resolve the human 
energetic paradox and may also point towards strategies for combating 
obesity and metabolic disease. Humans exhibit an evolved predispo- 
sition to deposit fat whereas other hominoids remain relatively lean, 
even in captivity where activity levels are modest. Untangling the evo- 
lutionary pressures and physiological mechanisms shaping the diver- 
sity of metabolic strategies among living hominoids may aid efforts 
to promote and repair metabolic health for humans in industrialized 
populations and apes in captivity. 

Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Sample. Adult apes (age > 10 years) were housed in Association of Zoos and 
Aquariums accredited zoos in the United States, or in sanctuaries in the Democratic 
Republic of Congo, with large enclosures. No statistical methods were used to pre- 
determine sample size. Institutional approvals were obtained from Hunter College 
and participating facilities before data collection. Human data were drawn from a 
recent large global study!’ (United States, Ghana, South Africa, Seychelles, Jamaica; 
institutional approvals from local governments and Loyola University, and each 
subject’s informed consent, were obtained before data collection). Men and women 
employed in manual labour were excluded from analyses to better match activity 
levels across species. Apes were excluded from analyses if they had a dilution space 
ratio less than 1.00, or had a calculated body fat percentage below — 10%, either 
of which could signal an error in dosing or sample processing. Owing to normal 
measurement error, the isotope dilution method for calculating body composition 
can produce marginally negative body fat estimates in subjects with fat percentages 
near 0. This occurred in n= 6 Pan subjects (range: —1 to —6% body fat), which is 
unsurprising given the low body fat percentage in Pan'® (Table 1). These subjects 
were included in analyses unless otherwise noted; they were excluded from anal- 
yses including fat mass (Supplementary Table 3), because negative values cannot 
be log-transformed. Pregnant and lactating mothers (m= 2 bonobo and 2 gorillas) 
were retained in the sample because ‘pregnant/lactating’ did not emerge as a signif- 
icant factor for TEE in multiple regression with FFM and genus (t(191) = —0.48, 
P=0.68). The experiments were not randomized. The investigators were not 
blinded to allocation during experiments and outcome assessment. 

Dosing and sampling. Dosing, sampling and analytical protocols for humans were 
as described previously'*. Ape protocols were as described previously’, and apes 
from that study (n= 14 Pan, n=3 Pongo, n=5 Gorilla) are included in this anal- 
ysis. Apes drank a sufficient dose of DLW (6% 7H30, 10% H,!80; Sigma-Aldrich) 
to achieve targeted enrichment for their body mass’’, which was measured using 
a digital scale before the study. Urine or saliva samples were collected once before 
dosing and then 3-5 times post-dose, over the course of 7-10 days. Isotopic enrich- 
ment (7H and '8O) was measured using isotope ratio mass spectrometry or cavity 
ring down mass spectrometry. Analysis of residuals indicates results were similar 
across laboratories and methods (Supplementary Fig. 2). 

Calculating TEE. Isotope depletion rates and dilution spaces were calculated via 
the slope-intercept method’. Three to five post-dose urine samples were ana- 
lysed for each subject, enabling us to identify and remove contaminated samples 
from analysis. Isotope depletion rates were used to calculate ape TEE (kcal day“) 
using equation 7.17 in ref. 12, using a food quotient of 0.95 (ref. 23) and a dilution 


LETTER 


space ratio of 1.04 following observed ratios in all ape species (1.04 + 0.02, n=56 
subjects; no effect of genus: F(53) =0.55, P=0.58 ANOVA). We also calculated 
TEE using equation 17.15 in ref. 12, which is tailored to human subjects and 
assumes human-like fractionation and insensible water loss. Ape TEE values 
using the human-tailored equation are 11 + 3% greater, but the pattern of TEE 
differences among hominoids is robust to the choice of equation (Supplementary 
Table 3). We prefer the more generalized mammalian equation for apes, since apes 
differ from humans in sweating capability and nasal morphology and are therefore 
likely to be more like other mammals (rather than humans) in fractionation and 
insensible water loss. 

TEE analysis. Variation in TEE was examined via multiple regression”® using 
In-transformed values for TEE, FFM and fat mass to reduce heteroscedasticity of 
residuals. Comparisons were made at the genus level due to the close phylogenetic 
relatedness of bonobos and chimpanzees. Bonobos and chimpanzees did not differ 
(F(32) =0.007, P=0.80 ANCOVA) in TEE in analyses controlling for FFM. 
Locomotor costs. Humans were fitted with triaxial accelerometers (Actical, 
Phillips Respironics) and counts per minute were logged for 7 days, correspond- 
ing with the DLW measurement period'”. Accelerometry was not feasible for apes. 
Instead, physical activity was recorded using direct observation for a minimum 
of 20h over the DLW measurement period. Distances walked and climbed were 
recorded every 15 min, using measurements of the enclosures to calibrate distance 
estimates. Rates of walking and climbing (metresh~') were calculated and multi- 
plied by 12 to give the estimated distance walked and climbed during the daytime. 
Night-time activity was not observable in apes and was therefore excluded from 
analyses for all species. 

BMR. Published respirometry measurements of BMR for chimpanzees 
(age: 2 months to 15 years) and comparably aged humans” (3-18 years) were 
In-transformed and analysed using multiple regression” controlling for sex, age, 
and In-transformed body mass. Humans had higher BMRs than chimpanzees 
(P< 0.001) in all analyses (Supplementary Table 3). These data, along with the 
few available data for orangutan BMR’"”, are plotted in Extended Data Fig. 2. 
Allometric regressions for chimpanzees and orangutans (Extended Data Fig. 2) 
were used to estimate BMR for the Pan and Pongo cohorts (Table 1). Published BMR 
prediction equations for adult humans”’ were used for the Homo cohort in Table 1. 
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26. R Core Team. R: A language and environment for statistical computing. 
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Extended Data Figure 1 | The human energetic paradox. Humans achieve greater reproductive output (g year” ’; black bar) and have larger brains 
(g; blue bar) relative to female metabolic mass (kg®”) than any of the great apes, yet also achieve longer lifespans (grey bar). Human data are from 
traditional hunter-gatherer and subsistence farming populations; ape data are from populations in the wild’ (see Supplementary Table 2). 
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Extended Data Figure 2 | BMRs for humans, chimpanzees and Resting metabolic rates (RMRs; kcal day~'), measured in alert orangutans 
orangutans. Available BMR data for chimpanzees’'® are primarily while sitting'’, are also shown. The top of the bar indicates the measured 
from juveniles (age: 2 months to 15 years), and are shown here against RMR for those individuals, and the bottom square indicates estimated 
comparably aged humans’ (3-18 years). Humans have greater BMRs BMR based on those measurements’’, assuming BMR = 0.8RMR. 

than chimpanzees in a general linear model of In-transformed BMR and No BMR data are available for gorillas. Symbols marked with black circles 
mass controlling for age and sex (P < 0.001; Supplementary Table 3). are males. Allometric regressions shown for Pan (y= 100.17x°) and 
Note that in humans (but not chimpanzees), males have greater BMRs Pongo (y= 218.61x°*”) and were used to estimate BMR for the adult 

for a given body mass, reflecting their greater proportion of FFM (that cohorts in Table 1. Human BMR in Table 1 was estimated from published, 
is, lower body fat percentage). The available data for orangutan BMR sex-specific predictive equations for adults”’. 
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consists of one juvenile individual’® (no age reported, mass 16.2 kg). 
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Deep-sea diversity patterns are shaped by energy 
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Skipton N. C. Woolley!, Derek P. Tittensor**, Piers K. Dunstan°, Gurutzeta Guillera-Arroita’, José J. Lahoz-Monfort?, 


Brendan A. Wintle*, Boris Worm? & Timothy D. O’Hara! 


The deep ocean is the largest and least-explored ecosystem on 
Earth, and a uniquely energy-poor environment. The distribution, 
drivers and origins of deep-sea biodiversity remain unknown 
at global scales'*. Here we analyse a database of more than 
165,000 distribution records of Ophiuroidea (brittle stars), a 
dominant component of sea-floor fauna, and find patterns of 
biodiversity unlike known terrestrial or coastal marine realms. 
Both patterns and environmental predictors of deep-sea (2,000- 
6,500 m) species richness fundamentally differ from those found 
in coastal (0-20 m), continental shelf (20-200 m), and upper-slope 
(200-2,000 m) waters. Continental shelf to upper-slope richness 
consistently peaks in tropical Indo-west Pacific and Caribbean 
(0-30°) latitudes, and is well explained by variations in water 
temperature. In contrast, deep-sea species show maximum richness 
at higher latitudes (30-50°), concentrated in areas of high carbon 
export flux and regions close to continental margins. We reconcile 
this structuring of oceanic biodiversity using a species—energy 
framework, with kinetic energy predicting shallow-water richness, 
while chemical energy (export productivity) and proximity to 
slope habitats drive deep-sea diversity. Our findings provide a 
global baseline for conservation efforts across the sea floor, and 
demonstrate that deep-sea ecosystems show a biodiversity pattern 
consistent with ecological theory, despite being different from other 
planetary-scale habitats. 

Deep-sea environments comprise approximately 66% of global 
sea-floor area, and hence more than half of the planet’s surface*. The 
sinking of biological material to the sea floor is a critical part of the 
global carbon cycle and climate. Yet global patterns of sea-floor diver- 
sity remain unknown, having so far been described only on local and 
regional scales*. Here we assemble a unprecedented data set on the 
global distribution of 2,099 Ophiuroidea (brittle and basket stars) 
species from shallow to abyssal depths, comprising 165,044 species 
distribution records from 1,614 research expeditions (Extended Data 
Fig. 1). Ophiuroidea are an ideal model taxon to analyse global pat- 
terns of species diversity as they are a dominant component of the 
fauna of many deep-sea habitats®. These data provide a unique oppor- 
tunity to uncover and compare deep-sea biodiversity patterns across 
three fundamentally different depth strata of the ocean, the continental 
shelf (20-200 m), upper continental slope (200-2,000 m) and deep-sea 
(2,000-6,500 m)”*. Furthermore, we propose that the deep sea can be 
viewed as a third replicate biome (after terrestrial and shallow-water 
diversity) to untangle the role of fundamental processes that shape 
global diversity. We analysed the three bathymetric strata separately, 
spatially estimated and mapped total species richness across a global 
grid using multi-species hierarchical occupancy-detection models 
(MSODM), and formally tested a number of prominent hypotheses 
on the factors shaping deep-sea diversity patterns using spatial linear 
models (see Extended Data Table 1) and a species—energy framework. 


Global patterns of species richness for shelf and upper-slope species 
are congruent with those of coastal marine species’. Both commu- 
nities show diversity peaks in the tropical Indo-west Pacific and the 
western Atlantic Oceans (Fig. la, b). However, in contrast to previous 
work, we find relatively high regional species richness around southern 
Australia and New Zealand (Fig. 1b)’. Species richness is generally 
suppressed on the western side of tropical America and Africa, and 
the northern Indian Ocean (Fig. 1a, b and Extended Data Fig. 2a, b). 
Deep-sea species richness shows a markedly different pattern, with 
peaks occurring predominantly at mid-to-high latitudes (Fig. 1c and 
Extended Data Fig. 2c), particularly across the boreal Atlantic Ocean, 
around Japan, New Zealand, western North and South America and 
western Africa. 

When global ophiuroid richness is examined by latitude and depth 
(Fig. 2), it peaks in the tropics at continental shelf (20-200 m) and 
upper-slope depths (200-1,200 m). A strong latitudinal biodiversity 
gradient exists at these depths with reduced richness at mid-to-high 
latitudes (>45°S and >55°N). These results are congruent with prior 
studies of shallow-water? and terrestrial’® global diversity gradients 
that suggest a unimodal diversity peak at low latitudes. However, 
at lower-slope to intermediate abyssal depths (2,000-6,500 m), 
bimodal maxima occur at temperate latitudes (30-40°S and 40-50°N), 
with distinct minima near the poles and at northern subtropical 
latitudes (15-30°N). Thus the typical latitudinal gradient of species 
richness observed near the planetary surface does not hold in the 
deep sea. 

We encode a priori hypotheses on processes expected to structure 
biodiversity’ by encapsulating them as potential drivers in a spatially 
explicit statistical model (see list of hypotheses in Extended Data 
Table 1). Geographical variation in energy availability (the species- 
energy hypothesis) is a factor thought to shape terrestrial and marine 
global biodiversity'’!*, through radiation (light), thermal (kinetic) 
or chemical (potential) energy. Unlike other realms, the first of these 
can be excluded from the aphotic deep-sea environment. Thermal 
energy may affect diversity through several mechanisms, including 
physiological tolerances, speciation/extinction rates, and availabil- 
ity of metabolic niches'*. Chemical energy in the form of reduced 
organic compounds is hypothesized to promote species diversity’; in 
the deep-sea this would be reflected by food resource availability man- 
ifested as particulate organic carbon (POC) flux. Non-energetic factors 
tested included oxygen stress, reflected on the upper slope by oxygen 
minimization zones (OMZs)"*; the environmental stress hypothesis 
proposes that species richness has a negative relationship with envi- 
ronmental stress’. Finally, long-term connectivity between shallower 
shelf and upper-slope species to deep-sea communities is expected to 
affect species richness'° via the regulation of deep-sea populations 
through extinction and radiation of species from connected regions’. 
Testing these hypotheses against patterns of deep-sea diversity helps 


1Museum Victoria, GPO Box 666, Melbourne, Victoria 3001, Australia. Quantitative and Applied Ecology Group, School of Biological Sciences, BioSciences Building 2, The University of Melbourne, 
Victoria 3010, Australia. 7Department of Biology, Dalhousie University, 1355 Oxford Street, Halifax B3H 4J1, Canada. “United Nations Environment Programme World Conservation Monitoring 
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Figure 1 | Global patterns of ophiuroid species richness. a—c, Multi- 
species occupancy detection models (MSODM) of summed occupancy 
probabilities at 500 km equal area resolution for shelf diversity 

(20—200 m; a), slope diversity (200—2,000 m; b) and deep-water diversity 


disentangle the environmental, ecological and historical forces shaping 
global diversity. 

Our statistical models revealed that the species—energy hypothesis 
is broadly supported at all depths, albeit through different forms of 
energy (Table 1). A significant relationship (P < 0.01) between rich- 
ness and sea-floor water temperature emerges at shelf and upper-slope 
depths, correlating with kinetic (specifically thermal) energy input 
from the sun. Strong thermal gradients are present in shelf and slope 
(but not deep-sea) regions, promoting greater species richness'*!®. 
However, there is a significant negative correlation between the diver- 
sity of shelf and slope environments and chemical energy, measured as 
POC flux to the sea floor (Table 1; POC, P< 0.05), probably because 
tropical shallow-water systems tend to be nutrient-poor. Conversely, 
deep-sea richness is not correlated with temperature but is signifi- 
cantly positively correlated with chemical energy export (Table 1; 
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(2,000—6,500 m; c). Land maps used in these figures are from the public 
domain Natural Earth and freely available for personal, educational, and 
commercial purposes. See http://www.naturalearthdata.com/about/ 
terms-of-use/ for more details. 


POC, P< 0.01) and regions of high seasonal surface productivity 
(SVI; P< 0.01). POC export is probably a key source of energy that 
maintains deep-sea species in regions of constant and low thermal 
energy” *'. The diversity of shelf communities within the model is 
also suppressed in OMZs (Table 1; oxygen stress, P< 0.05). These 
zones are dysoxic, with less than 2 ml O, per litre of seawater, and are 
known to have substantially reduced faunal diversity and biomass”. 
Thus environmental stress appears to play an additional role in influ- 
encing global patterns of ophiuroid richness. For deep-sea environ- 
ments, connectivity is also a significant predictor, with a decline in 
species diversity with distance from continental margins (Table 1; DC, 
P<0.05). This finding implies that the continental margins are a long- 
term source of abyssal diversity. This is consistent with the radiation 
hypothesis’° that predicts deep-water diversity is maintained by immi- 
gration from bathyal sources’’. 
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Figure 2 | Estimated mean ophiuroid species richness plot as a function 
of depth and latitude. a—c, Species richness predicted from MSODMs at 
depth intervals from surface to lower-abyss depths for binned equal-area 
latitudinal regions across the global extent of longitude. Mean species 
richness estimated from MSODMs for shelf diversity (20—200 m; a), slope 
diversity (200—2,000 m; b) and deep-water diversity (2,000—6,500 m; c). 


Observed patterns of deep-water diversity may shed some light on 
the drivers of large-scale gradients of diversity in other environments. 
Mannion et al.’* suggested that two different classes of latitudinal 
diversity gradients occurred over the history of the Phanerozoic eon 
(542 million years ago). A tropical maxima and polar minima existed 
during relatively cool ‘icehouse’ conditions*’ where there is a strong sea 
surface temperature divergence between equator and pole. Conversely, 
a flat diversity gradient or temperate peak occurred during warmer 
‘greenhouse’ conditions, when there was less of a temperature gradient, 
indicating that thermal energy was probably a key driver of geographic 
variation in richness. Here we observe no tropical peak in diversity of 
deep-water assemblages (2,000-6,500 m), suggesting that it is uniform 
temperatures, rather than hot ‘greenhouse’ conditions, that supress a 
low-latitude diversity peak. Our results imply that energy availabil- 
ity determines the latitudinal diversity gradient, but in the deep sea, 


The vertical dashed line represents the equator. The grey contour lines 
represent the top 20% of species richness for each bathome. Depths are 
binned at 50 m intervals for shelf regions (0-50, 50-100, 100-150 and 
150-200 m), at 200-m intervals for slope (200-2,000 m), and 500-m 
intervals for deep-water (2,000-6,500 m). 


unlike the rest of the ocean, this derives from chemical rather than 
thermal sources. 

Currently, we know little about the evolutionary processes that at 
least partially gave rise to bathyal and abyssal species diversity patterns. 
Multiple hypotheses postulate in situ diversification, or immigration 
from shallower depths'®*. It is unclear how the abyssal and lower 
bathyal fauna re-establish after extinction events such as anoxia”’. Our 
modelled estimates of species richness depict higher diversity on the 
upper slope; these results lend support to the upper slope being the 
source of deep-sea diversification. Observed patterns of species richness 
are highest near continents (Fig. 1), showing the relationship between 
deep-water diversity and connectivity to continental margins (Table 1). 
Thus our data and analyses lend support to the theory that the deep-sea 
fauna at least partially originates from range expansion of upper-bathyal 
species into the deep-sea. However, these suggested peaks of diversity 
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Table 1 | SLM for species richness of three bathomes 


2,000-6,500m 

20-200m 200-2,000m _ (lower slope 
Bathome (shelf) (upperslope) § and abyss) 
Species richness 126 110 all 
Annual mean temperature (AMO) 11.49*** 3.6154" 
Annual mean temperature —2.71** 
(AMO)? 
Annual mean oxygen (AMO) —2.17* 
Annual mean oxygen (AMO)? 
Seasonal variation in NPP (SVI) 3.54% Lois 
Seasonal variation in NPP (SVI)*?_ —2.48* 
Particulate organic carbon flux —4,43* —3.06* 3.097" 
(POC) 
Particulate organic carbon flux 2a —2.46* 
(POC)? 
Distance to continental margin n/a n/a 0.45* 
(DC) 
Distance to continental margin n/a n/a 
(DC)? 
Oxygen stress (OMZ) 
Oxygen stress (OMZ)? 1.71% 
Pseudo-R? 0.35 0.37 021 


Spatial linear model (SLM) results for the species richness of the three bathomes 20-200 m, 
200-2,000 m and 2,000-6,500 m are shown. Maximum species richness for each bathome 

is highest individual cell value. Model results are from the best SLM as determined by Akaike 
information criterion value. Model results are z values; *P < 0.05, **P < 0.01, ***P < 0.0001. 
Distance to continental margin is only applicable for deep-water (LSA; lower-slope and abyss). 
n/a, not applicable. 


are also regions of high export productivity, a strong energetic predic- 
tor of richness. To delineate better the processes shaping evolutionary 
origins of deep-sea fauna, comprehensive phylogenies are required. 
In conclusion, our findings reveal a unique global pattern of deep- 
sea benthic biodiversity that is unlike any other environment. We 
are able to reconcile the vertical structuring of marine biodiversity 
through a species—energy framework, a fundamental theory of the 
origination of biodiversity. Our findings also support the radiation 
hypothesis, suggesting deep-water richness is maintained by immigra- 
tion from shallower regions. These results have important implications 
for identifying potential protected areas on the high seas, both within 
and outside national jurisdictions. Tropical areas are typically highly 
diverse in shallow waters and on land, and thus often the focus for 
conservation efforts. In contrast, areas of higher export productivity 
and connectivity to shallower communities will need to be considered 
for conservation and management action in the deep-sea. Our results 
provide a much-needed empirical and spatial baseline for global con- 
servation planning in the deep ocean, which is urgently motivated by 
the accelerating pressures from deep-sea fishing, mining and other 


cumulative impacts on this vulnerable region”®. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Data. Biological data. Global brittle-star occurrence data (84° N to 78°S latitude 
and 180° W to 180° E longitude) was derived from 1,614 research expeditions, cov- 
ering a 130 year timespan, starting with iconic nineteenth century voyages such as 
the Challenger expedition’. Brittle-star species occurrence records were collected 
from three major bathomes: shelf (SH; 20—200 m), slope (SL, 200—2,000 m) and 
deep-water (lower slope and abyssal plane; LSA, 2,000—6,500 m). These depth 
strata were selected to reflect existing biogeographical bounds of bathyal ophi- 
uroids®, Ophiuroid occurrences at hadal depths (>6,500 m) were removed as the 
data were very sparse and would probably result in fragile inference of patterns 
at these depths. Ophiuroidea identifications were verified by taxonomic experts 
to species level (including author T.O.). Specimens were collected using ad hoc, 
semi-quantitative and quantitative methods, including trawls, dredges, epiben- 
thic sleds, grabs and corers. The highest density of ad hoc samples (for example, 
collections by hand) corresponded to coastal occurrence records (0O—20 m) and 
were subsequently removed from analyses to minimise potential collection bias”®. 
The spatial extent and proportion of survey effort per 500-km cell were plotted to 
visualize spatial bias in collection effort (Extended Data Fig. 1). 

Environmental data. Environmental and physical predictors were used to test 
hypotheses that seek to explain patterns of deep-sea species richness (see Extended 
Data Table 1 for a summary of hypotheses name, meaning, relevance, origin and 
related predictors). Environmental predictors were tri-linearly interpolated to the 
sea floor using global ETOPO1 ice-surface GIS bathymetric data set”®. Annual 
mean sea-floor temperature (°C) (AMT), annual standard deviation of sea-floor 
temperature (°C) (ASDT) and annual mean oxygen (ml 1~') (AMO) were derived 
from the CARS 2009 data set*?*!. The CARS climatology physical oceanography 
data (1950-2009) were interpolated across the globe for 79 depth layers at a reso- 
lution of 0.5° latitude/longitude. We also calculated the proportion of AMO grid 
cells that had <2 ml per litre Op, a critical physiological limit for numerous marine 
species* and typically the threshold for oxygen minimization zones (OMZs)". 
Mean annual net primary productivity (gC m? year~!, NPP) and seasonal var- 
iation of net primary productivity (g C m~? year '; SVI) were generated from a 
vertically generalized production model (VGPM)**. NPP and SVI are a function 
of satellite-derived chlorophyll (SeaWiFS). NPP and SVI were calculated across the 
years 2003 to 2010 (see http://www.science.oregonstate.edu/ocean.productivity/). 
Particulate organic carbon flux to the sea floor (POC flux; g C m~ year!) was 
estimated using NPP and SVI data and a productivity export model**. Distance 
from continental margins (DC) for deep-sea habitats was estimated based on the 
IFREMER Continental margins shape file**. We created a spatial layer that calcu- 
lates distance of sea-floor habitat to nearest point on the 2,000 m contour around 
continental margins and islands. 

For the statistical analyses, environmental predictors were averaged to cell size 

across the three bathomes (20-200 m, 200-2,000 m and 2,000-6,500 m). Strongly 
correlated variables (>0.7) were removed from analyses to avoid issues with co- 
linearity of model coefficients (Extended Data Table 2). AMO was removed from 
the shelf analysis, due to its correlation with AMT. We removed NPP from analyses 
due to its correlation with POC flux. We selected POC flux over NPP as we were 
interested in the amount of carbon flux at the sea floor, rather than the surface. 
All independent variables used in statistical analyses were centred and normalized 
(mean = 0, variance = 1). All analyses were performed at spatial scales of 500 km 
equal area grid cells. 
Statistical analysis. We were interested in describing patterns of species richness 
and the processes that shape observed patterns in the deep-sea benthos. Many 
authors have approached these analyses using either bottom-up**” or top-down 
methods”. The respective merits of both approaches are still debated in the eco- 
logical literature*®. We see merits in both approaches, using them for different 
purposes. 

For a top-down approach we linked estimates of species richness derived from 
the estimated asymptotes of species accumulation curves to environmental and 
physical data using spatial regression models (SLMs). This assumes that the envi- 
ronment is likely to impose top-down limits of species richness, independently 
of species identities. Critically, unlike the species distribution modelling (see 
below), we estimated the number of species in a region (cell) independently of 
environment predictors, thus enabling us to assess potential determinants of 
richness in our modelling framework without circular reasoning. We therefore 
use this approach to test hypotheses of processes that shape global deep-sea 
species richness. 

Our second approach used the summation of species distribution models to 
assess species richness. Here we used an extension of classic species distribution 
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models that incorporates detection probabilities when assessing the distribution 
of modelled species. Details of our two approaches are discussed in the following 
section. 

Spatial linear models (SLMs). Estimation of richness via species accumulation 
curves. The ‘coverage-based rarefaction’ method*? was used to estimate species 
richness on a cell-by-cell basis. This method estimates species richness based on 
a measure of sample completeness*’. The aim is to estimate the ‘sample deficit’, 
which represents the fraction of the community that remains undiscovered”. This 
is a novel alternative to species accumulation curves based on the extrapolation of 
individuals or samples’, and one which attempts to scale the richness of each cell 
to an equivalent level of sampling coverage for all cells. For our maps of diversity, 
we used a 75% coverage-based estimate of the number of species per cell as a 
conservative balance between extrapolation and completeness of sample coverage. 
For each cell we ran the estimator with 1,000 bootstraps, and took the mean as 
our point estimate of species richness for each cell. To assess the performance of 
cell-by-cell estimation of the number of species, we plotted all estimated species 
accumulation curves and their respective bootstrap bounds for each cell and vis- 
ually assessed the curvilinear nature of each extrapolation. Species accumulation 
curves that (1) did not show asymptotic behaviour or (2) had extreme confidence 
bounds based on bootstrapping, were removed from further analysis. Species 
richness interpolations and extrapolations were calculated using the R package 
Vegan” and code adapted from the package iNEXT™. 

Modelling of estimated richness as a function of environmental predictors. Estimated 
species richness was used as a response variable in models that tested hypotheses 
about its relationship to environmental predictors. We used SLMs that explicitly 
account for spatial autocorrelation (Extended Data Table 3), specifically simulta- 
neous autoregressive models (SARs)**. Neighbourhood size was selected using an 
error-SAR process, based on the minimum AIC for spatial null models (model con- 
taining the intercept and the spatial autocorrelation term). Neighbourhood sizes 
between 1,000 km and 10,000 km were tested at 100-km intervals. Neighbourhood 
size was determined independently for each depth strata, as it was expected that 
different bathomes would display differing extents of spatial autocorrelation due to 
different ecological and evolutionary process driving the spatial patterns of species 
richness. We used an all-model selection method to find the AIC-best model. 
We analysed the models and the relative importance of predictors through z-tests 
(SLMs). We used pseudo-R? to assess model fit. We fitted linear, and second-order 
polynomial functions for each predictor variable given that a number of studies 
have emphasized the importance of unimodal relationships with temperature* and 
POC flux? (Extended Data Fig. 3). Models were fitted using the errorslm function 
in spdep** package in R. 

Multi-species occupancy-detection models (MSODMs). The second approach 
for analysis involved MSODMs*’™, a relatively novel but promising community- 
modelling framework that allows flexible consideration of species distributions 
and their detectability. This modelling framework is grounded in the view that 
species richness and other attributes of community structure are best described 
using models of individual species occurrence that explicitly account for imper- 
fect detection during sample collection®'. This framework thus explicitly 
deals with potential biases in sampling effort, as those expected in our deep-sea 
species data. 

MSODMs provide a hierarchical and explicit description of the state (species 
occurrences) and observation (species detection) processes. At the heart of the 
approach is the estimation of the incompletely observed site-by-species occurrence 
matrix, from which different summaries of community structure can be derived. 
The presence or absence of species i at a site j is described as the outcome of a 
Bernoulli trial 


Zi ~~ Bernoulli(y;) 


where 7); is the probability that species i is present at site j, and the latent variable 
Z; represents whether the species is present or not at the site (Zj takes value 0 or 1). 

The observation model describes the observed data as the outcome of a series 
of independent Bernoulli trials with probability Px at sites where the species is 
present (Z;;= 1) and 0 elsewhere, that is, 


Yijk|Zij ~ Bernoulli(Z;j Pj) 


where Yj are the observed data (detection/non-detection of species i at site j 
during survey k), and pj are the corresponding species detection probabilities 
(the probability of detecting species i at site j during survey visit k). The model 
assumes that the occupancy status of cells (Z;;) do not change during the survey 
period, which is a reasonable assumption at the geographical scale of our analysis 
and time frame of the data collection. Occupancy and detection probabilities can 
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then be modelled as a function of relevant environmental predictors following the 
generalized linear modelling framework, for example, 


logit(w) = Go; + G1; x covariate;; +... + B,; x covariate nj 


where n is here the number of predictors in the occupancy component of the model 
(including quadratic terms, interactions, and so on). In our model, occupancy 
probability was described as a function of the 12 to 14 covariates (depending on 
the depth strata) using linear and quadratic terms (Extended Data Figs 4 and 5). We 
ran a single model with all covariates and considered covariate contribution, rather 
than using model selection. Detection probability was described as a function of 
the collection method (for example, dredge or grab) used in each collection event 
(that is, survey visit k at site ) 


logit (p;i,) = oi + Oj X Geary, 


In the MSODM framework, individual species models are linked through random 
effects in a hierarchical fashion, this way exploiting similarities in environmental 
responses to borrow information across species. This is achieved by describing the 
parameters from species-specific models as realizations from a common distri- 
bution, whose parameters (or ‘hyperparameters’) are estimated. For our analysis, 
parameters were described using independent normal distributions as follows 


Ba~ N(Hg03,) and au~ N( Ho. %a,) 


Once a MSODM is fit, species richness and other metrics of community structure 
can be derived based on the parameter estimates obtained. In particular, species 
richness is simply obtained by summing the estimated occupancy probabilities 
across species. The estimated species richness at site j (N;) is thus calculated as 


I 
N= D5 
i=1 


that is, the species richness estimate at site j is equal to the expected number of 
species at the site. 

The MSODM framework allows inference about the number of species that 
were completely missed during sampling*”**. In our analysis however, estimation 
was restricted to species with a minimum of ten occurrences because of computa- 
tional limitations on estimating rare and undetected species in large species by sites 
matrices. We fitted separate MSODMs to the three bathomes, shelf (320 species), 
upper slope (440) and deep-sea (58) (Fig. la-c). 

Our MSODM model makes two key assumptions: cell sites are visited multiple 
times over a period of population closure during which the occupancy status (Zix) 
of a site does not change. This assumption is likely to be violated at small spatial 
scales, however as we are estimating occupancy at large scales, it is possible to 
assume that occupancy in the 500-km cells should not change over the time scale 
of these data collections. The second assumption is that species identification is 
constant across the data set with no false-positive identifications in the data. The 
data used in this analysis was verified by a taxonomic expert (T.O.) from museum- 
based records or the taxonomic literature to ensure consistent identification across 
the whole data set. The spatial MSODMs for shelf, slope and deep-water assem- 
blages are presented in Fig. la—c. We can also assess the uncertainty in spatial 
predictions of occupancy probabilities across all species in the MSODMs, we do 
this by presenting the mean variance in MSODM predictions for shelf, slope and 
deep-water diversity. The variance predicted occupancy probability is calculated 
spatially for each species and the mean variance across all species per cell (Extended 
Data Fig. 6a, b). 

Models were fitted using JAGS**”?, a program for Bayesian inference using 
Markov chain Monte Carlo (MCMC). JAGS was controlled via an R script using 
package R2jags**. Three chains were run with different initial values, a burn-in of 
2,000 iterations and a minimum of 20,000 iterations with a thinning by 50. Model 
convergence was assessed using the R (‘R-hat) statistic®>. We present parameter 
posteriors distributions for covariate estimates, which represent the distribution 
of all species response to each covariate. We also present the 10th—90th percentile 
of species partial response to covariates as a function of occupancy (1) for each 
bathome (Extended Data Fig. 4), we also report the mean posterior distributions 
of parameter estimates for all species (Extended Data Fig. 5). 

We also compared deviances of null (intercept only) and full covariate models 
for each bathome. Table of deviances, DIC and pD (an estimate of deviance relative 
to variance) are presented in Extended Data Table 4. Bespoke C++ code, written 
using C++ and Armadillo C++ linear libraries°®, which was integrated into the 
R environment using Rcpp”’ and ReppArmadillo”’, was used to predict the occu- 
pancy of species individually from fine-scale environmental data. The code uses a 
500 km moving window to estimate fine scale probabilities based on the original 


500 km resolution of the original MSODMs. This essentially smooths predictions 
to be representative of the original cell size estimates. 

Figure 2 was derived from MSODMs, by predicting the probability of occupancy 
for each species at a series of depth bounds. For the shelf we predicted the proba- 
bility of our 320 species at 50-m depth intervals (0-50, 50-100,100-150 and 
150-200 m). For slope species we broke up the environment into 200-m depth 
intervals (200-2,000 m). While for abyss and lower slope we broke up the depth 
bands into 500-m intervals (2,000-6,500 m). The estimated species richness at site 
j (Nj) is thus calculated for each 500-km cell at each depth layer. We then took the 
mean of Nj for each latitudinal band across the global prediction. 

All analyses were undertaken in the R statistical language version 3.0 (ref. 59). 
Details about the packages and functions used are given under each section (we 
have made our JAGS model code publicly available on GitHub: https://github. 
com/skiptoniam/msodm_code). Spatial predictions were plotted using ArcMap 
v10 (ref. 60) and R spatial packages (Rgdal®!, raster, Maptools® and dismo™). 
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(a) (d) 


Extended Data Figure 1 | Distribution of global sampling effort across effort from 2,000-6,500 m (c). d-f, Ophiuroid distribution data are 


deep-sea bathomes. a-c, Spatial plots of sampling effort for ophiuroid presented for shelf (d; red; 20-200 m), slope (e; orange; 200-2,000 m) and 
occurrence data at the same equal-area grid cells used in MSODM at deep-water (f; yellow; 2,000-6,500 m); key represents depth profile. Land 
500-km equal area grid cells, maximum effort is capped at 100 surveys to maps used in these figures are from the public domain Natural Earth and 
help visualize low and high regions of repeated effort. Shelf effort from freely available for personal, educational, and commercial purposes. See 


20-200 m (a), slope effort from 200-2,000 m (b) and deep-water collection —_http://www.naturalearthdata.com/about/terms-of-use/ for more details. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Figure 2 | Model estimated global deep-sea species and absences of species within each cell. The estimated count of species 
richness across different depth strata. a~c, Maps of species counts (N) as — (N) per 500km equal area grid cell are reported in each figure key. Land 
calculated using MSODM are presented as shelf (a), slope (b) and deep- maps used in these figures are from the public domain Natural Earth and 
water species (c). N is an estimate of species present per cell based on our freely available for personal, educational, and commercial purposes. See 
occurrence matrix (Z). Z is a latent variable used to calculated presences http://www.naturalearthdata.com/about/terms-of-use/ for more details. 
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Extended Data Figure 3 | Linear partial residual plots as derived from SLMs. a-i, Partial residual plots for significant variables included in the models 
for global deep-sea richness at shelf (a—c, 20-200 m), upper-slope (d-f, 200-2,000 m) and deep-water (LSA; g-i, 2,000-6,500 m). Hatched lines are partial 
fits (red lines). Values on the x axis are centred and normalized (mean = 0, variance = 1), as derived from SLMs. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


f 
co co we co co co 
oS SS oO = oS oO 
pus el ae = + ae 
oO j=) oO oO j=) j=) 
2° 2° 2° ° ° ° 
j=) j=) oO oO j=) (=) 
-1.5 0.0 1.0 -1 123 -3 -1 i 2 -2 C12as -1 1234 oO 8S 10 15 
SD of Seafloor Oxygen Concentration Seasonal variation of NPP _—_‘ Particulate Organic Carbon Flux Oxygen Stress 
Seafloor Temperature (C°) Temperature (C°) (AMO; m/l) (SVI; g C m-2 year-1) (g C m-2 year-1) (OMZ; 02% < 2 ml/l) 
h | 
« io © © oo © 
SE j=) oO oO 1) (= 
a a a! pe Sh pe 
oO oO [o) oO oO oO 
° 2 ° ° ° ° 
fo) j=) fo) oO jo) j=) 
-1.5 0.0 1.0 -1 1283 -3 -1 a, 22 -2 Oo1283 -1 1234 0 5 10 15 
SD of Seafloor Oxygen Concentration Seasonal variation of NPP? articulate Organic Carbon Flux | Oxygen Stress 
Seafloor Temperature ey Temperature ory (AMO; ml/l) (SVI; g C m-2 year-1) ‘9 C m-2 year-1) ‘i (OMZ; 02% <2 mill) 
@ r 
2 2 © © i ee 
= j=) oO oO oS j=) 
St = St = = = 
oO j=) oO oO j=) j=) 
2° ° 2° °o icici sata 2° 2° acerca 
oO j=) oO oO = | j=) 
-1 1 3 02 4 6 2 -1 0 1 -2 O 2 4 0246 8 0 5 10 “tb 
SD of Seafloor Oxygen Concentration Seasonal variation of NPP Particulate Organic Carbon Flux Oxygen Stress 
Seafloor Temperature (C°) Temperature (C°) (AMO; m/l) (SVI; g C m-2 year-1) (g C m-2 year-1) (OMZ; 02% < 2 mi/l) 
5 t U Vv Ww X 
oo co oe co co co 
oS S oS oO ( = | oO 
St a. = me ae = 
ae} =) oO oO oO j=) 
° ee 2 2 ° ° 2 
oO oO oO oO oO oO 
-1 1 3 02 4 6 2 -1 0 1 -2 O 2 4 0246 8 0 5 10 15 
SD of Seafloor Oxygen Concentration Seasonal variation of NPP articulate Organic Carbon Flux . Oxygen Stress 
Seafloor Temperature er Temperature icy (AMO; ml/l) (SVI; g C m-2 year-1) ‘g C m-2 year-1) . (OMZ; 02% <2 mill) 
z aa bb cc dd ee 
oO © co oo oo co in 
=) [= | a | oO | } j=) oO 
St = St = SS = St 
=) oO So Cc oa cS o 
° ° ° ° cases | ° ° ° a 
oO [=] oO oO oO j=) = 
-2 Z2AE S&S 0 2 4 & -2 O41 2 2 0 2 4 0246 8 -1 123 Aa 0 4 8 12 
SD of Seafloor Oxygen Concentration Seasonal variation of NPP _—‘ Particulate Organic Carbon Flux Distance from Oxygen Stress 
Seafloor Temperature (C°) Temperature (C°) (AMO; ml/l) (SVI; g C m-2 year-1) (g C m-2 year-1) Continental Margin (m) (OMZ; 02% < 2 mi/l) 
ff go hh ii ij kk Il 
oo co oe io © co © 
Le } oS oO oO oO (=) oO 
= + Se = ps = st 
oO [=] oO oO | = (=) oO 
o | _éifi. a 2 —_ w | © _|_ dll aeane 2 o _|mettie... 
oO [=) oO oO oS (=) (=) 
0246 8 202 4 6 2-1 0 1 2 -1 423 4 O2A ES 0123 4 0 4 8 12 
SD of Seafloor Oxygen Concentration Seasonal variation of NPP articulate Organic Carbon Flux - Distance from Oxygen Stress 
Seafloor Temperature (ey Temperature ey (AMO; ml/l) (SVI; g C m-2 year-1) ‘9 C m-2 year-1) Continental Margin (m)? (OMZ; 02% <2 mill)? 


Extended Data Figure 4 | Environmental relationships covariate 
estimated with the multi-species occupancy-detection model. The 
shaded areas represent the regions delimited by the tenth-ninetieth 
percentiles of the estimates obtained from the responses of all species. 


From top to bottom, rows display the estimates of occupancy (w), for shelf 
(green), slope (blue) and abyss (red) species. All covariates were centred 
and normalized (mean = 0, variance = 1). 
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Extended Data Figure 5 | Bayesian posterior estimates. a—-c, Deep-water MSODM parameter estimates, for shelf (a), slope (b), and deep-water species (c). 
Posterior distributions of parameter estimates are across all species. All covariates were centred and normalized (mean =0, variance = 1). 
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Extended Data Figure 6 | Mean variance of MSODM predictions of domain Natural Earth and freely available for personal, educational, and 
species occupancy probabilities. Variance for shelf diversity commercial purposes. See http://www.naturalearthdata.com/about/terms- 


(a, 20-200 m), slope diversity (b, 200-2,000 m) and deep-water diversity of-use/ for more details. 
(c, 2,000-6,500 m). Land maps used in these figures are from the public 
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Extended Data Table 1 | Encapsulation of species richness hypotheses by environmental and physical predictors 


Hypotheses 


Kinetic 
(thermal) 
energy 
Hypothesis 


Productivity- 
richness 
hypothesis 


The 
environmental 
stress 
hypothesis 


Evolutionary 
rates 
hypothesis 


Source-sink 
Hypothesis 


Predictors 


Annual Mean 
Seafloor 
Temperature 
(AMT; C°) 


Annual 
Standard 
Deviation of 
Seafloor 
Temperature 
(ASDT; C°) 


Modelled 
Particulate 


Organic Carbon 


Flux (POC flux; 
g C m” year’) 


Modelled 
Seasonal 
variation of net 
primary 
productivity 
(SVI; g C m-2 
year-1) 


Oxygen 
Minimization 
Zones (OMZs), 
Proportion of 
grid cells with 
AMO 
concentrations 
<2 mill. 


Annual Mean 
Oxygen 
Concentration 
(AMO; mi/l) 


Distance from 
Coast/Continent 
al Margin 


Predictor Derivation 


CARS 2009 dataset. 
Interpolated available 
oceanographic data 
(from 1950-2009) 
across the globe for 
79 depth layers at a 
resolution of 0.5° 
latitude/longitude. 
averaged 


Particulate organic 
carbon flux to the 
seafloor (POC flux; g 
C m-2 year-1) was 
estimated using NPP 
and SVI data and 
productivity export 
models. 


SVI is a function of 
satellite-derived 
chlorophyll 
(SeaWiFS). NPP and 
SVI were calculated 
across the years 
2003 to 2010. 

CARS 2009 dataset. 
Interpolated available 
oceanographic data 
(from 1950-2009) 
across the globe for 
79 depth layers at a 
resolution of 0.5° 
latitude/longitude. 
averaged 

CARS 2009 dataset. 
Interpolated available 
oceanographic data 
(from 1950-2009) 
across the globe for 
79 depth layers at a 
resolution of 0.5° 
latitude/longitude. 
averaged 

We estimated 
distance from 
continental margins 
using IFREMER 
Continental margins 
shape files. 


References 12-14, 16-19, 65, 66 are cited in this table. 


Hypothesis Description 


Warmer temperatures 
lead to higher richness. 
Possible mechanisms 
include (but not limited to): 
(i) increased metabolic 
rates leading to increased 
rates of speciation / 
decreased extinction rates 
(ii) warmer environments 
are easier to tolerate 
physiologically; (iii) 
warmer environments 
provide more metabolic 
niches. 


This hypothesis’ predicts 

a positive effect of primary 
productivity on species 
richness. Theory and 
empirical studies also 
support the idea that 
seasonal pulses of 
productivity can sustain 
diverse populations. 


Predicts a negative 
relationship with 
environmental stress and 
richness. Low oxygen 
concentration is an 
important environmental 
stressor in marine 
systems. 


Oxygen increases 
mutation rates, but directly 
contributing to increasing 
metabolic rates and 
production of free radicals 
and oxygen reactive 
species. 


Source-sink hypothesis 
predicts that species 
assemblage of abyssal 
species form a source- 
sink system in which 
abyssal populations (sink) 
are regulated by 
immigration from bathyal 
sources. 


Expected 
pattern 
anticipated for 
bathome(s) 


TO.12, 13 


Slope; Deep- 
Water 


112,19,65 


Shelf; Slope 


Shelf, Slope, 
Deep-Water. 


13,66 
1617 
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Extended Data Table 2 | Correlations between environmental predictors used in GLMs, SLMs and MSODMs by bathome 


Bathome 
AMT _ASDT___ AMO NPP POC 

AMT -0.780 -0.228 0.000 0.195 
ASDT -0.180 0.171 0.408 0.115 
AMO 1.000 -0.015 -0.251 -0.492 
NPP -0.237 0:357 0.939 0.338 
POC SVI -0.015 1.000 0.591 0.197 
POC -0.251 0.591 1.000 0.360 
OMZ -0.492 0.197 0.360 1.000 
DC - - 


AMT -0.023 0.152 0.214 
ASDT 0.347 0.522 0.118 
AMO 0.093 -0.056 -0.497 
NPP 0.517 0.899 0.220 
POC SVI 1.000 0.599 0.207 
POC 0.599 1.000 0.188 
OMZ 0.207 0.188 1.000 
DC - - 


AMT -0.210 0.150 0.076 -0.112 
ASDT -0.210 0.119 0.020 0.078 
AMO 1.000 0.001 -0.341 -0.113 
NPP -0.044 0.904 0.177 -0.353 
POC SVI 0.182 0.520 0.182 -0.393 
POC 0.001 1.000 0.178 -0.375 
OMZ -0.341 0.178 1.000 -0.149 
DC -0.113 -0.375 -0.149 1.000 


Correlations with an absolute value of greater than 0.7 are highlighted as bold. 
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Extended Data Table 3 | Top SLMs based on AIC under all model selection for each bathome (AAIC of <2) 


|Modeh SS 
|Variable Selection C—C“‘(CSSOOOOOCOCOC#C#C“(SNN’NNNNNNNNNNN | _—s | AIC |_—__—R?-|_ Moran's P-value | 
Shelf Lo 
| 1+ AMT + AMT? + OMZ+OMZ?+SVI+SVi?2+POC+POC?C“‘“LSSOC'«* | 242.351 | 0.362[ 0.188 
Slope 
1_+ AMT + AMT? + POC + POC? 


1 + AMT + AMT? + AMO + AMO? + POC + POC? 2591.206 | 0.373 0.658 


Deep-Water (LSA) a a 
1 + SVI+ SVE + POC + POC* 
1+ AMT + AMT? + POG + POC? 


We present model covariates including linear and quadratic terms, the number of parameters (k), Akaike information criterion (AIC), R¢ and Moran’s P value for each model. 
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Extended Data Table 4 | Deviance reduction between null multi-species occupancy detection models and fully fitted models 


Bathome Null Deviance Full Model Null pD Full Model pD | A Deviance Deviance 
Deviance Reduction 


Shelf 109583.51 (BCI: | 98126.56 (BCI: | 101050.02 10412.05 11456.93 11.7% 
109247 .72,1101 | 97824.14, 9844 
62.04) 7.66 


Slope 173525.87 (BCI: | 167309.26 (BIC | 847841.31 13329.09 6216.607 3.6% 
172516.77, : 167043.31, 16 
176573.34) 7560.76) 
Deep-water 19133.33 (BCI: 13658.34 (BCI: | 130153.60 4314.64 5474.98 28.6% 
(LSA) 18368.61, 13480.75, 
201 47.84) 13842.62) 


Estimates are presented with Bayesian confidence interval (BCI) for hierarchical multi-species occupancy detection models. pD is a Bayesian statistic that measures deviance, it is represented as: 
pp = acerene), which is calculated in JAGS software. 
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Local fitness landscape of the green 


fluorescent protein 


Karen S. Sarkisyan!**:+*, Dmitry A. Bolotin!?*, Margarita V. Meer*”, Dinara R. Usmanova***, Alexander S. Mishin!?, 
George V. Sharonov!’’, Dmitry N. Ivankov**’, Nina G. Bozhanova!, Mikhail S. Baranov!’, Onuralp Soylemez*”, 

Natalya S. Bogatyreva*>®, Peter K. Vlasov*”, Evgeny S. Egorov!, Maria D. Logacheva?!?"", Alexey S. Kondrashov!)”, 
Dmitry M. Chudakov!?, Ekaterina V. Putintseva!’, Ilgar Z. Mamedov!’, Dan S. Tawfik'’, Konstantin A. Lukyanov!? & 


Fyodor A. Kondrashov*"!4 


Fitness landscapes!” depict how genotypes manifest at the 
phenotypic level and form the basis of our understanding of many 
areas of biology” ’, yet their properties remain elusive. Previous 
studies have analysed specific genes, often using their function as a 
proxy for fitness”*, experimentally assessing the effect on function of 
single mutations and their combinations in a specific sequence”>* 
or in different sequences”*»!® 18, However, systematic high- 
throughput studies of the local fitness landscape of an entire 
protein have not yet been reported. Here we visualize an extensive 
region of the local fitness landscape of the green fluorescent protein 
from Aequorea victoria (avGFP) by measuring the native function 
(fluorescence) of tens of thousands of derivative genotypes of avGFP. 
We show that the fitness landscape of avGFP is narrow, with 3/4 of 
the derivatives with a single mutation showing reduced fluorescence 
and half of the derivatives with four mutations being completely 
non-fluorescent. The narrowness is enhanced by epistasis, which 
was detected in up to 30% of genotypes with multiple mutations and 
mostly occurred through the cumulative effect of slightly deleterious 
mutations causing a threshold-like decrease in protein stability 
and a concomitant loss of fluorescence. A model of orthologous 
sequence divergence spanning hundreds of millions of years 
predicted the extent of epistasis in our data, indicating congruence 
between the fitness landscape properties at the local and global 
scales. The characterization of the local fitness landscape of avGFP 
has important implications for several fields including molecular 
evolution, population genetics and protein design. 

We assayed the local fitness landscape of avGFP by estimating the 
fluorescence levels of genotypes obtained by random mutagenesis 
of the avGFP sequence (Fig. 1). We used fluorescence-activated cell 
sorting (Supplementary Fig. 1) and sequenced the entire GFP coding 
region to assay the fluorescence of many thousands of genotypes cre- 
ated by random mutagenesis of the wild-type sequence (Supplementary 
Information 2 and Extended Data Fig. 1). We applied several strategies 
to minimize the error of our estimate of fluorescence (Supplementary 
Information 3.4 and 4.4), which was estimated from thousands of 
independent measurements of the wild-type sequence (false negative 
error rate = 0.08%) and genotypes incorporating mutations known to 
eliminate fluorescence (false positive error rate = 0.24%). Our final data 
set included 56,086 unique nucleotide sequences coding for 51,715 
different protein sequences. Our procedure introduced an average 


of 3.7 mutations per gene sequence, and most assayed genotypes 
contained several, up to 15, missense mutations. Still, since the total 
number of possible sequences grows exponentially with the number 
of mutations, the fraction of sampled sequences was tiny for sequences 
containing more than two mutations (Extended Data Table 1). We used 
these data to survey the local fitness landscape of GFP, analysing the 
effect of single, double and multiple mutations. 

The distribution of fitness effects of individual missense mutations 
was assayed by comparing the distribution of fluorescence of wild- 
type avGFP amino acid sequences, tagged by different molecular 
barcodes, and the distribution of fluorescence of sequences carrying 
a single mutation (Supplementary Information 4.1). We found that 
at least 75% of mutations had a deleterious effect on fluorescence, 
including 9.4% of single mutations conferring a more than fivefold 
decrease in fluorescence, but for many mutations the effect was small 
(Fig. 2a). Accordingly, genotypes with multiple missense mutations 
were more likely to have low fluorescence, and most genotypes carrying 
five or more missense mutations were non-fluorescent (Extended Data 
Fig. 2). Mutations with a strong effect on fluorescence preferably 
resided at sites that coded for amino acid residues oriented internally 
towards the chromophore (Fig. 2b, c), which is consistent with data 
on other proteins on the preference of deleterious mutations to tar- 
get buried residues”'!~'’. The effect of mutations on fluorescence was 
positively correlated with site conservation (Extended Data Fig. 3a, 
Spearman's rank correlation coefficient = 0.40, P= 1.44 x 10~"°), and 
mutations with a deleterious impact were less likely to be found in 
orthologous sequences (Extended Data Fig. 3b) and more likely to 
be found in sites with a deleterious deletion (Supplementary Fig. 2). 
Still, ~10% of mutant states conferring a non-fluorescent pheno- 
type were nevertheless fixed in long-term evolution (Extended Data 
Fig. 3b), and a substantial fraction of genotypes containing only 
mutations leading to amino acid states from GFP orthologues was 
non-fluorescent (Supplementary Fig. 3), indicating that epistasis affects 
the avGFP fitness landscape”. 

Interaction of deleterious mutations can manifest in positive epista- 
sis, when the joint effect of mutations is weaker than their independent 
contribution, or negative epistasis when the joint effect is stronger 
(Fig. 3a). Light intensity is perceived in the logarithmic scale by living 
beings, including jellyfish'’. Thus, we defined epistasis e as the devia- 
tion from additivity of effects of single mutations on the logarithmic 
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Figure 1 | Exploring the local fitness landscape. a, Wild-type avGFP 
(centre) and most single mutants (innermost circle) fluoresce green. 
Genotypes with multiple mutations may exhibit negative epistasis, with 
combinations of neutral mutations creating non-fluorescent phenotypes 
(grey), or positive epistasis, in which a mutation in a non-fluorescent 
genotype restores fluorescence. b, The GFP sequence arranged in a circle, 
each column representing one amino acid site. In the first circle, the colour 
intensity of the squares indicates the brightness of a single mutation at the 


scale. We compared the decrement of the log-fluorescence of a multiple 
mutant Fy to the sum of decrements of individual mutants, such that 
é = (Fnutt — Fwr) — >; (Fi- Fwr), where Fwr and F; are the log- 
fluorescence values conferred by wild-type avGFP and avGFP with the 
i-th single missense mutation, respectively. We restricted the expected 
fluorescence of the multiple mutant 57, (F; — Fwr) to the observed 
maximum or minimum levels (Supplementary Information 4.2). This 
eliminated spurious detection of epistasis, which could otherwise 
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Figure 2 | The effect of single mutations on avGFP. a, The distributions 
of independently measured fluorescence for 2,442 wild-type sequences 
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corresponding site relative to the wild type, shown in the centre. Sites with 
positive and negative epistatic interactions between pairs of mutations 

are connected by green and black lines, respectively. In circles further 
away from the centre, representing genotypes with multiple mutations, 
the fraction of the column coloured green (black) represents the fraction 
of genotypes corresponding to high (low) fluorescence among all assayed 
genotypes with a mutation at that site. Scissors indicate the restriction site. 


occur, for example, when a non-fluorescent double mutant consists of 
two mutations, both of which individually confer a non-fluorescent 
genotype. We defined strong epistasis as |e] > 0.7, or as cases in which 
the observed fluorescence differed from the expected by at least five- 
fold, with a false discovery rate of <1% (Supplementary Fig. 4). 
Negative epistasis affected up to 30% of all genotypes, depending on 
the number of mutations (Fig. 3b, c), which resulted in a larger than 
expected fraction of non-fluorescent genotypes (Fig. 3c). Genotypes 


c 


mutations (white). b, c, Single missense mutations strongly decreasing 
fluorescence (violet) tended to occur at sites with internally oriented 
residues (b), shown on a selected 8-strand of the GFP structure (c). 
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Figure 3 | Prevalence of epistasis in the local fitness landscape of avGFP. 
a, A hypothetical representation of negative and positive epistasis as a 
function of the number of single mutations from avGFP. WT, wild type. 

b, The fraction of observed non-fluorescent genotypes (red) and the 


carrying more than seven mutations showed a decrease in the preva- 
lence of negative epistasis because many genotypes carrying multiple 
mutations were expected to lose fluorescence even without epistasis 
(Fig. 3b). Positive epistasis was rare in avGFP, on the order of accu- 
racy of our method. We sampled ~2% of all possible pairs of muta- 
tions (Extended Data Table 1), assaying 30% of pairs of amino acid 
sites (16,898 out of 55,696, Extended Data Fig. 4a). Epistatic pairs 
of sites were located across the avGFP sequence (Extended Data 
Fig. 4a), mostly beyond the range of direct physical interaction of amino 
acid residues (Extended Data Fig. 4b) but marginally closer together 
than random (Extended Data Fig. 4c, P< 0.0004, Mann-Whitney 
U-test). Epistasis was found among 96% of mutations with weak effect 
(Extended Data Fig. 4d), suggesting that their joint effect brings the 
protein over some stability margin®”°. Finally, epistasis was more 
common between pairs of sites in which both residues are internally 
oriented (Extended Data Fig. 4e). Taken together, these data indicate 
that epistasis was more common at functionally important sites. 

In a unidimensional landscape, fitness is a monotonic function of 
an intermediate variable known as fitness potential”!””, which is the 
sum of effects of individual mutations. We used multiple regression 
considering a non-epistatic fitness function in which log-fluorescence, 
F, is equal to the linear predictor, the fitness potential, p, such that 
F=f(p) =p. This simplest, non-epistatic model explained only 70% of 
the initial sample variance (0? = 1.12 and o* = 0.34 before and after the 
application of the model, respectively). Using the variance of the 2,442 
wild-type fluorescence measurements, we estimated that ~1% of the 
initial sample variance can be attributed to noise (0? = 0.0097), indi- 
cating that the remaining 29% of sample variance cannot be explained 
without epistasis. 

The simplest form of an epistatic fitness function is when fitness is 
a monotonic nonlinear function of p*’”. The lack of genotypes with 
intermediate fluorescence (Extended Data Fig. 5a) suggests that the 
avGFP fitneses landscape can be described by a truncation-like fitness 
function”’. We therefore modelled F as a sigmoid function of p, which 
explained 85% of the initial sample variance (0? = 0.17). A more com- 
plex sigmoid-shaped fitness function refined with a neural network 
approach (Supplementary Information 4.6) explained 93.5% of the ini- 
tial sample variance (o” = 0.065, Extended Data Fig. 5), confirming that 
the fitness landscape can mostly be represented by a unidimensional 
threshold function (Fig. 4), which can arise from the joint contribution 
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expected fraction of non-fluorescent genotypes calculated as the 

sum of the log-fluorescence effects of individual mutations (blue). 

c, The distributions of epistasis for negative and positive epistasis of 
different strength, with the expected false discovery rate shown in grey. 


of mutations to protein stability®!*'*?°*, The average fluorescence 
of single mutants of avGFP as a function of the predicted protein 
destabilization, AAG, reveals a threshold around 7-9 kcal mol™! 
(Fig. 4). Notably, the hidden value found by the artificial neural net- 
work for single mutants correlated to the predicted AAG (Fig. 4 and 
Extended Data Fig. 5f), confirming a probable influence of protein 
stability on the nature of epistasis in avGFP. The threshold fitness func- 
tion does a remarkably good job in approximating the entire fitness 
landscape, explaining ~95% of all variance. However, when taking into 
account the error rate of our data set, we estimate that at least 0.3% 
of genotypes cannot be explained by the threshold fitness function 
(Supplementary Information 4.5 and Extended Data Fig. 5d), repre- 
senting instances of multidimensional epistasis””. 

We compared the local avGFP fitness peak to the global GFP fitness 
landscape using sequences of GFP orthologues. Negative, threshold- 
like epistasis, leading to truncation selection*’ against slightly 
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Figure 4 | Modelling genotype to phenotype relationship. Median 
fluorescence of GFP with single mutations as a function of their effect 
on predicted folding energy (AAG), overlaid with the independently 
obtained sigmoid-like fitness function predicted by the neural network 
(orange line). Error bars denote s.d. 
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Figure 5 | The fitness matrix model of GFP long-term evolution. 

The normalized rate of convergent evolution to terminal and reconstructed 
ancestral amino acid states for each distance bin (grey dots). The expected 
(orange line) and observed in experimental data (orange dots) probability 
that a single mutation remains fluorescent as the sequence accumulates 


deleterious mutations may prevent their accumulation in evolution”””®. 


Thus, we compared the fraction of neutral single mutations to 
the rate of nonsynonymous and synonymous evolution (dN/dS), 
a proxy for the average strength of selection. The average dN/dS 
across a broad phylogenetic range was 0.35 + 0.1 (mean +s.d.), and 
0.17 when avGFP was compared to the orthologue from the closest 
fluorescent relative Aequorea macrodactyla. These measurements are 
similar to the estimated proportion of neutral mutations in avGFP 
(0.23), suggesting that the proportion of neutral mutations is similar 
across distant fitness peaks. The rate at which the phenotypic effect of 
amino acid substitutions changes across evolution, which is reflected 
in the changing rate of convergent evolution across the phylogenetic 
tree’”®, can be used to model the prevalence of epistasis with the 
fitness matrix model*®. This model approximates the prevalence of 
epistasis as the proportion of amino acid mutations that markedly 
change their effect on fitness after the occurrence of substitutions 
at other sites (Supplementary Information 5). Applying the fitness 
matrix model to the GFP multiple alignment, we predicted the pro- 
portion of mutations that change their effects on fluorescence when 
found in a different genetic background, revealing prevalence of 
positive and negative epistasis, which concur with our experimen- 
tal observations (Fig. 5 and Supplementary Fig. 5). The congruence 
of the data from an evolutionary trajectory spanning hundreds of 
millions of years with experimental data are remarkable, suggesting 
similarity in the local and global structures of the fitness landscape 
shaped by strong epistatic interactions. 

Our study provides complementary results of the analysis of single 
and double mutations to several previous studies”!'~'*, and a novel 
depiction of a large segment of the fitness landscape of a single protein. 
The proportion of neutral single mutations in our data was similar to 
that observed when fitness was assayed directly or through competi- 
tion experiments!?**”? but substantially lower than that observed in 
functional studies*!°!"!*'7, Furthermore, the propensity of multiple 
mutations to have a stronger negative effect on fitness than the sum 
of individual mutation effects has been observed”!®'*!*, However, 
because our analysis considered genotypes carrying multiple mutations, 
we infer a wider picture of the local fitness landscape (Supplementary 
Video 1). The avGFP fitness peak is narrow and defined by negative 
epistasis, best described by truncation selection in which fluorescence 
is eliminated if the joint effect of mutations exceeds a threshold of 
an intermediate property, possibly protein stability®”°. Such a land- 
scape increases the efficacy of selection against slightly deleterious 
mutations”’, preventing their accumulation in evolution*”®. 
Simultaneously, the fitness landscape is approximately non-epistatic 
near the fitness peak. If other proteins have a similar fitness landscape 
it would support the nearly neutral theory of evolution*®, explaining 
the selective forces and evolutionary dynamics of mutations with 
negligible individual effects on fitness. 
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other substitutions. The expected (green line) and observed (green dots) 
probability that a non-fluorescent mutation becomes fluorescent with 
sequence divergence. Bars represent a binomial proportion confidence 
interval (confidence level 68%). 


The broad congruence of our data with the prevalence of epistasis 
from long-term evolution suggests that the shape of the local fitness 
landscape can be extrapolated to a larger scale. Yet, epistasis between 
sites coding for residues with a direct interaction in protein structure 
was rare, contrasting with observation of such instances in long-term 
evolution’® and a mutation assay of the RNA recognition motif (RRM) 
domain’’. Thus, the local fitness landscape spanning a few mutations 
from a single fitness peak may be approximated by a unidimensional 
threshold fitness potential function; however, this simple fitness 
function may not be appropriate to describe fitness landscapes that 
incorporate fitness ridges connecting sequences of more divergent 
orthologues”’. The nature of global fitness landscapes, especially the 
interaction between local and global scales, remains to be explored. 
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Extended Data Figure 1 | Scheme of the experimental approach. The depiction of the construct design, expression and cell sorting. 
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Extended Data Figure 2 | Fluorescence and impact of mutations. A violin plot of the measured levels of fluorescence for genotypes carrying different 


numbers of missense mutations. 
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Extended Data Figure 3 | Mutant genotypes and evolution. a, b, The log-fluorescence and evolutionary conservation expressed as Shannon entropy (a), 
and fraction of mutant amino acid states found in avGFP orthologues (b). The y-axis error bars in b show the binomial proportion confidence interval 
level (68%), and other error bars denote s.e.m. 
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Extended Data Figure 5 | Modelling effect of mutations on fluorescence. 
a, A multiple linear regression in which fluorescence is linear combination 
of effects of individual single mutations. b, A multiple regression in which 
mutations contribute linearly to a fitness potential and fluorescence is 

a sigmoidal function of p where Fee ?/(1+e ?). c, d, The predicted 
fluorescence by a neural network approach. Predicted fitness function by 
a neural network with one hidden neuron and two neurons in the outer 
layer. e, The scheme of our neural network approach. The genotype data 
was passed to the input layer of the neural network as an array of 0s or Is 
corresponding to the absence or presence of amino acid mutations in the 
genotype, respectively. The first hidden layer consisted of a single neuron 
that calculated the weighted sum of inputs using weights obtained during 


1.5 2.0 2.5 


training. The output of the first hidden layer was passed through an 
output subnetwork that transformed this value with a nonlinear function 
to make the final prediction of fluorescence. The output subnetwork 
consisted of several neurons with a sigmoidal transfer function, allowing 
the subnetwork to approximate a broad range of nonlinear functions. The 
final mapping of the hidden value to fluorescence was determined by the 
weights of connections between neurons inside the output subnetwork. 
During training all weights were optimized to find the best prediction 

of fluorescence from the hidden value. The resulting function that was 
defined during training is shown in Fig. 4. f, Correlation between the 
hidden value of the neural network and Rosetta-predicted AAG for single 
mutants. 
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Extended Data Table 1 | Genotypes with measured fluorescence in our data set 


Number of Number of Number of Fraction of Number of Number of Fraction of 
mutations amino acid possible amino acid nucleotide possible nucleotide 
from the sequences amino acid sequences sequences nucleotide sequences 
wild type assayed sequences* sampled assayed sequences sampled 


Te es em 
Se 


7,215 24x10" 4040" 8,164 3.63 x 10" 2.25x10"" 
pe 4,643 48x10" 96x10°° 5,869 1.28 x 10” 4.58 x10" 
2,783 8.5x 10" 33x10" 3,664 3:67 x10" 9.47x 10" 
po | 1,526 13x10" 12o%40" 2915 1.02 x 107? 247x100" 
a 1.8 x 107 44 x4107 1,229 359x410" 513K 10 


*We considered possible amino acid sequences as sequences in which every amino acid sequence is the result of single nucleotide substitutions from the wild-type avGFP sequence. 
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Opponent and bidirectional control of movement 
velocity in the basal ganglia 


Eric A. Yttri! & Joshua T. Dudman! 


For goal-directed behaviour it is critical that we can both select the 
appropriate action and learn to modify the underlying movements 
(for example, the pitch of a note or velocity of a reach) to improve 
outcomes. The basal ganglia are a critical nexus where circuits 
necessary for the production of behaviour, such as the neocortex 
and thalamus, are integrated with reward signalling’ to reinforce 
successful, purposive actions”. The dorsal striatum, a major input 
structure of basal ganglia, is composed of two opponent pathways, 
direct and indirect, thought to select actions that elicit positive 
outcomes and suppress actions that do not, respectively**. Activity- 
dependent plasticity modulated by reward is thought to be sufficient 
for selecting actions in the striatum. Although perturbations of 
basal ganglia function produce profound changes in movement’, 
it remains unknown whether activity-dependent plasticity is 
sufficient to produce learned changes in movement kinematics, 
such as velocity. Here we use cell-type-specific stimulation in mice 
delivered in closed loop during movement to demonstrate that 
activity in either the direct or indirect pathway is sufficient to 
produce specific and sustained increases or decreases in velocity, 
without affecting action selection or motivation. These behavioural 
changes were a form of learning that accumulated over trials, 
persisted after the cessation of stimulation, and were abolished 
in the presence of dopamine antagonists. Our results reveal that 
the direct and indirect pathways can each bidirectionally control 
movement velocity, demonstrating unprecedented specificity and 
flexibility in the control of volition by the basal ganglia. 

Purposive action requires selection of a goal (for example, go left) 
and execution parameters (for example, how fast to go). For example, in 
bird song selection of both discrete, sequential actions (syllables) as well 
as the pitch can be controlled by reinforcement in cortico-basal ganglia 
pathways®”. The striatum is a major input nucleus in basal ganglia’ 
and the direct and indirect pathway are primarily composed of two 
molecularly distinct'° populations of medium spiny projection neurons 
(MSNs): direct striatonigral (AMSN) and indirect striatopallidal 
(iMSN) neurons. Sustained activation of dMSNs increases movement, 
whereas sustained activation of iMSNs reduces movement". Asa result, 
the balance of activity-dependent plasticity at cortical synapses onto 
dMSNs and iMSNs is thought to underlie the selection of successful 
goal-directed actions*’'”. While it is known that stimulation of direct 
pathway neurons can support self-stimulation’’ and bias concomitant 
choice behaviour", there is little direct evidence that MSN activity is 
sufficient to produce persistent, specific changes in subsequent actions. 

We trained mice expressing channelrhodopsin-2 (ChR2) in either 
dMSNs or iMSNs to perform self-paced, bimanual forelimb movements 
while head-fixed to obtain a water reward (Fig. 1a; Supplementary 
Videos 1-4). These single, discrete movements provided a reliable, 
repeatable behaviour from which we could extract movement para- 
meters (Fig. 1b-d). To determine whether activity in MSNs during 
a voluntary action is sufficient to control movement parameters, we 
administered closed-loop photostimulation to the dorsomedial stri- 
atum during the fastest third of movements. Stimulation intensity 


was adjusted to be subthreshold for direct effects on movement, but 
sufficient to modulate activity to a similar magnitude as endogenous 
modulation of striatal activity during limb movements (Fig. le, f and 
Extended Data Fig. 1). Stimulation onset occurred within 15 ms of the 
beginning of a movement and persisted for 450 ms (comparable to 
movement duration; 505 ms; Fig. 1c, d). To maintain motivation to per- 
form the task independent of stimulation, all movements that crossed 
the criterion amplitude threshold elicited a delayed liquid reward. 

We first asked whether photostimulation of dMSNs during the fastest 
third of movements could alter the velocity of subsequent movements. 
Indeed, brief dMSN stimulation was sufficient to produce a signifi- 
cant increase in the peak velocity (1.4cms™' increase from 29.7 cms~!; 
P<7x 10°; Fig. 2 and Extended Data Fig. 2) of all limb movements. 
Other movement parameters that were not targeted for closed-loop 
stimulation such as the amplitude, duration, and tortuosity remained 
unaltered (P > 0.7). This is despite the fact that mice were capable of 
rapidly adjusting movement parameters to changing reward contin- 
gencies (Extended Data Fig. 3). By contrast, iMSNs stimulation during 
the fastest third of limb movements produced a significant reduction 
in peak velocity (—1.1cms~'; P<7 x 10~“). The effect of iMSN stim- 
ulation had its maximal effect on velocity; movement duration and 
tortuosity were not significantly altered (P> 0.3). Prolonged tonic acti- 
vation of dMSNs tends to be pro-kinetic in that it evokes generalized 
increases in voluntary movement (‘response vigor’), whereas tonic 
activation of iMSNs tends to decrease voluntary movement''. However, 
we found that neither brief dMSN nor iMSN stimulation during the 
fastest movements produced a change in the rate of trial initiation or the 
rate of licking during reward anticipation and consumption (Fig. 2b and 
Extended Data Table 1). These results thus demonstrate that closed- 
loop activation of MSNs is sufficient to produce sustained changes in 
movement parameters without generalized changes in movement or 
motivation. 

We next examined the effect of successive stimulation on limb move- 
ment velocity. If stimulation merely altered the velocity of the current 
movement, then repeated stimulation should produce an immediate, 
but constant effect. However, stimulation drove a steady change in 
velocity that accumulated over the course of several trials (Fig. 2d), 
apparent in individual sessions (Fig. 2a and Extended Data Fig. 2). 
We also found that unstimulated movements (trials with subthreshold 
velocity) were changed to a similar extent. dMSN stimulation produced 
a 0.9cms_‘ increase (P=0.014) in velocity on unstimulated move- 
ments whereas iMSN stimulation produced a —1.0cms' decrease 
(P=0.001) in the velocity of unstimulated movements. Moreover, there 
was no change in variance of the distribution of velocities through- 
out the session (F-test, P> 0.5 for both groups, Extended Data Fig. 4). 
Together these observations argue that selective stimulation produced 
a gradual, accumulating shift in the entire distribution of velocities, 
rather than a change restricted to the stimulated subset (for example, 
making only fast, stimulated limb movements even faster). These 
cumulative changes in behaviour may be contrasted with previous 
reports of optogenetic stimulation that have observed transient effects 
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Figure 1 | Paradigm for closed-loop stimulation in dorsomedial 
striatum. a, Mice were head-fixed in front of a side-mounted joystick 
and a water port. Optical fibres were chronically implanted. Tips were 
positioned in the dorsomedial striatum and coupled to a 473 nm laser. 
Insert shows fibre position; D, dorsal; V, ventral. Fluorescent image is from 
iMSN neurons expressing ChR2-YFP. b, To receive liquid reward, mice 
made forelimb movements with the joystick (either a pull or push) past the 
criterion distance. Reward delivered 1s after threshold crossing. Inter-trial 
intervals (ITIs) were 3s (uncued). c, Instantaneous velocity and position 


confined to the stimulated trial!*»'* or concomitant with stimulus 
delivery’. 

If stimulation of the fastest movements produces a persistent change 
in the selection of movement parameters, the change should persist 
without stimulation. We plotted the velocity of movements made dur- 
ing the block of trials immediately following the stimulation block. 
In this recovery block, no stimulation was delivered. We found that 
stimulation-induced changes in the distribution of velocities persisted 
for tens of trials before gradually returning to the pre-stimulation 
baseline during the recovery block (Fig. 2a, d; paired t-test, P=0.64 
and 0.90, dMSN and iMSN, respectively). Importantly, this return to 
the pre-stimulation distribution had a similar time course whether it 
required a decrease or increase in the mean velocity following dMSN 
or iMSN stimulation, respectively. 

We have shown that dMSN and iMSN have opponent roles in the 
reinforcement of movement parameters with unprecedented specificity. 
Stimulation induced changes are signed—dMSN stimulation increases 
a kinematic parameter of movement (velocity) whereas iMSN stimu- 
lation decreases the same property. However, there is a limitation to 
this simple opponency for learning: reinforcement should, in principle, 
alter behaviour so as to increase a reinforcing outcome regardless of the 
sign of the behavioural change’®. It should be possible, for example, to 
learn to move more slowly to obtain more reward. Our data are also 
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of joystick for seven trials (green triangle indicates trial start). Velocity 
threshold for closed-loop optical stimulation and time of stimulation 
onset indicated by the blue dashed line and diamonds, respectively. Yellow 
squares indicate reward. d, Histograms of movement amplitude, peak 
velocity, and duration for all eight mice (45 sham sessions). e, Average 
response (z-scored change from baseline firing rate) of striatal units 
aligned to movement onset from a single session. Population average 
shown above. f, Raster plot of population activity during photostimulation 
from a single session. 


consistent with an alternative possibility: dMSN stimulation may be 
sufficient to drive changes towards movements that elicit stimulation 
independent of the sign (positive/negative) of the change. To distin- 
guish between these alternatives, we stimulated MSNs during the slow- 
est, rather than the fastest, third of limb movements. This stimulation 
protocol produced the opposite effects for both dMSN and iMSN stim- 
ulation (Fig. 2e, f). Under these conditions, stimulation of dMSN was 
sufficient to produce a cumulative decrease in velocity (—1.1cms~’, 
P=0.008). Conversely, iMSN stimulation produced an accumulating 
increase in velocity (0.9cms~', P=0.012). Thus, the direct and indirect 
pathways of the basal ganglia are opponent pathways that are also suffi- 
cient for bidirectional changes in a continuous parameter that specifies 
purposive movement. 

Models of the basal ganglia in which reinforcement learning acts to 
select amongst mutually exclusive actions can explain a broad array 
of empirical results in the learning literature'*. However, such models 
cannot readily account for reinforcement acting on a continuous para- 
meter of movement such as velocity!” (see Supplementary Discussion). 
By contrast, a learning rule in which closed-loop stimulation provides 
a pathway-specific, signed learning signal that determines the mean 
of the velocity distribution could reproduce our data (Fig. 3a and 
Methods). Owing to the bidirectional behavioural changes observed, 
this learning rule makes a specific prediction: stimulation on every trial 
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Figure 2 | Closed-loop stimulation produces opponent, bidirectional 
control of movement velocity. a, Difference in peak velocity between 
stimulation and sham session (Avelocity) for sessions in which (MSN 
(upper, blue throughout) or iMSN (lower, red throughout) were stimulated 
on the fastest third of 50 trials during stimulation and no stimulus was 
delivered during recovery. Example session shown. b, Histograms of 
inter-movement-interval (left) and lick rate during reward consumption 
(right) for sham (black; 25 sessions in 4 dMSN mice, 20 sessions in 4 iMSN 
mice) and stimulation (coloured; 22 sessions in dMSN mice, 26 sessions 

in iMSN mice) sessions. c, Population average of change in movement 
parameters when fastest third of reaches were stimulated. d, Population 
average Avelocity as a function of movement (trial) number when fastest 
third of reaches were stimulated. e, f, Same as c, d, but for sessions in 
which stimulation occurred on the slowest third of movements. *P < 0.05; 
** P< 0.005, two tailed t-test. Shaded area indicates standard error of the 
mean. Data are from 16 stimulation and 18 sham sessions in the same 

4 dMSN mice, 20 stimulation, 16 sham sessions in the same 4 iMSN mice. 


or at random throughout a session should produce no net change in 
velocity. Consistent with this prediction, each simulated stimulation 
protocol failed to produce a detectable change in movement velocity 
(P > 0.2 for all conditions, Fig. 3 and Extended Data Fig. 5). 

As formulated, this learning rule would induce a persistent change 
in velocity following stimulation. Extinction formulated as a fixed 
decay in synaptic weight'* would not produce symmetric recovery as 
observed (Fig. 2 and Supplementary Discussion). To account for this 
feature of the data, we assumed a homeostatic component and refer 
to the rule as ‘mean shift with homeostasis’ (MeSH). Thus, the mean 
velocity of movement is determined by a set point that opposes learned 
changes and restores velocity towards baseline during recovery. When 
incorporated into the learning rule, we found that simulations closely 
reproduced the data during stimulation and recovery epochs. Selective 
stimulation that biased the reward-based feedback steadily drove veloc- 
ity towards (dMSN) or away (iMSN) from the threshold that elicited 
stimulation (Fig. 3a). Upon cessation of stimulation, the homeostatic 
component drove recovery to the pre-stimulation baseline within 
50 trials (Fig. 3). 

MeSH assumes an explicit interaction between reward signalling, 
putatively carried by midbrain dopaminergic inputs to the dorsal 
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striatum®, and exogenous activation of MSNs. In contrast to this pre- 
diction, previous work has suggested that intracranial self-stimulation 
supported by striatal stimulation is independent of dopamine recep- 
tor activation’. However, the movement-related activity of striatal 
populations and our use of brief 450 ms stimulation both differ from 
the sustained, post-movement stimulation used previously. Thus, we 
next asked if dopamine was necessary for stimulation to elicit changes 
in movement velocity. We found that a low concentration of D1 and 
D2 receptor antagonists (SCH23390, 0.02 mgkg™! and sulpiride, 
25 mg kg~') injected before a behavioural session’? eliminated per- 
sistent changes in velocity following closed-loop stimulation of either 
dMSN or iMSN (Fig. 3b, d) while largely sparing normal task perfor- 
mance (Fig. 3c, all dMSN parameters P > 0.2; all iMSN parameters 
P>0.3). Dopamine antagonists significantly reduced the magnitude 
of the stimulation effect for both dMSN stimulation (92% decrease; 
1.3cms !, P<1x 1078) and iMSN stimulation (109% increase; 
L2cms",P=1.5%10-’). 

Our results suggest that stimulation engages a dopamine-dependent, 
bidirectional plasticity in striatal MSNs. While a learning rule that acts 
directly on a parameter specifying the velocity distribution can account 
for our behavioural results, MeSH is abstract and it is unclear how it 
could be implemented in corticostriatal circuits critical for goal-di- 
rected, instrumental behaviour!’. Thus, we implemented a simplified 
corticostriatal circuit model with the following key features (Extended 
Data Fig. 6). 

While an action-value formulation implies that movements of 
different velocities are represented as a set of distinguishable neural 
states, MeSH implies a continuous representation of speed. There is 
little empirical evidence for representation of specific velocity ranges 
in cortical activity'’. By contrast, there is substantial evidence that 
cortical’®”° and striatal?! representations of forelimb movements are 
monotonically tuned to speed. Consistent with the anatomy of the 
corticostriatal pathway’, we assume that the speed of a movement is 
determined by both cortical and basal ganglia output (Extended Data 
Fig. 6). In combination with monotonic tuning this implies that the 
mean movement velocity is proportional to the average weight of 
corticostriatal synapses (Methods). 

Dopamine and spike-timing dependent plasticity (STDP) has been 
described in the striatum””’. STDP can result in bidirectional plasticity 
with the balance of potentiation and depression adapted to the range 
of population activity (that is, BCM-type plasticity”) in the presence 
of variable spike trains”. We assume a balance of potentiation and 
depression such that movements made at the average speed produce 
no net change. We posit that photostimulation enhances both poten- 
tiation and depression consistent with our observation that selective 
stimulation produces biased changes whereas non-selective stimulation 
produces no net change (Figs 2 and 3; Supplementary Discussion). 
Balanced synaptic plasticity that is enhanced by stimulation is sufficient 
to produce bidirectional changes in the average corticostriatal synapse 
weight during selective photostimulation. When incorporated into a 
corticostriatal circuit model, this plasticity rule produces opponent, 
bidirectional, and symmetric changes in movement speed (Fig. 4a, b 
and Extended Data Fig. 6). 

Finally, we sought to validate our circuit model with electrophys- 
iological recordings from dorsomedial striatum. Individual units 
(putative MSNs) recorded from mice performing the task were mono- 
tonically tuned to movement velocity (Fig. 4c), confirming previous 
observations*’®. An important feature of our behavioural results was 
that closed-loop stimulation does not simply reinforce stimulated 
movements, but rather produces a change in the mean velocity. Thus, 
changes in striatal activity should be apparent even for unstimulated 
movements. Specifically, the slope relating firing rate to velocity should 
be changed (Fig. 4b). Moreover, if photostimulation is necessary to alter 
plasticity in the recorded neuron then slope changes should correlate 
with photostimulation (Fig. 4b). To test this prediction we analysed a 
population of striatal units (n = 35) during closed-loop stimulation of 
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Figure 3 | Changes in velocity are consistent with dopamine-dependent 
reinforcement learning. a, Simulation of MeSH learning rule (see text for 
details). Change in average peak velocity (arbitrary units) as a function of 
trial number for dMSN-stimulation (blue) and iMSN-stimulation (red) 
simulations. b, AVelocity as a function of trial for stimulation of (MSN 
(blue) and iMSN (red) on the fastest third of 50 stimulation trials in the 
presence of dopamine receptor antagonists. Data from 14 stimulation 

and 11 sham dMSN sessions; 8 stimulation and 9 iMSN sham sessions. 


dMSNs on the fastest third of movements. Consistent with the model 
predictions, we observed an increase in the slope of the velocity tuning 
associated with an increase in the average velocity (Fig. 4d). Changes 
in tuning tended to be most prominent on units responsive to 
photostimulation (putative dMSNs; Fig. 4d) whereas neurons with 
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Figure 4 | Corticostriatal circuit model implements MeSH rule and 
experimental validation. a, Change in movement velocity (arbitrary 
units, AU) as a function of number of simulated trials (shaded area 
indicates standard deviation; n = 100) for simulations in which (MSN 
(blue) or iMSN (red) were stimulated during fastest ~30% of reaches. 

b, Histogram of the change in the average slope between dMSN (blue) or 
iMSN (red) activity and movement velocity for simulations with dMSN 
stimulation. Triangles indicate mean. Circles indicate mean change in 
weight of corticostriatal synapse (see text for details). c, Mean firing 

rate during movement (0-505 ms after onset) is plotted as a function of 
peak movement velocity with overlaid linear fit (‘tuning slope’) for 

3 of 35 example striatal units. d, Stimulation response (per cent change 

in baseline firing rate during photostimulation) as a function of the change 
in the tuning slope (Aslope) for unstimulated trials (subthreshold velocity) 
of sessions in which dMSN were stimulated on the fastest third 

of movements. Linear fit overlaid (r= 0.47). 
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weak stimulation responses (putative iMSNs or distant dMSNs) tended 
to show decreases in apparent tuning slope (Fig. 4d). 

Here we provide the first demonstration, to our knowledge, that the 
innate biases in the direct and indirect pathway to increase or decrease 
the frequency of movement, respectively, do not extend to fixed biases 
in the control of movement parameters. The direct and indirect path- 
ways engage opponent, activity-dependent plasticity mechanisms 
that can produce sustained biases in future behaviour. Each pathway 
is sufficient to produce bidirectional changes and, to some extent, 
is innervated by distinct cortical populations”’, suggesting that 
bidirectional control by each pathway could allow for adaptive con- 
trol of goal-directed actions in different contexts or under different 
demands. These data argue that phasic activity in the striatum during 
specific movements is sufficient to selectively reinforce changes in a 
movement parameter independent of a generalized change in motiva- 
tion consistent with a role for dopamine-dependent signalling in dorsal 
striatum in the control of movement vigour”)”°8, 

Our results reveal a bidirectional control of behaviour by MSNs that 
may be contrasted with the recent observation that self-stimulation 
supported by MSNs is unidirectional and dopamine-independent’?. 
The differences between the findings may reflect the different experi- 
mental paradigms. Selectively biasing striatal activity in the context of 
a reward-based operant task could engage mechanisms distinct from 
the reinforcing properties of photostimulation itself. In the latter case 
strong stimulation may be sufficient to replace dopaminergic inputs 
or support self-stimulation in a dopamine-independent manner’’. In 
addition, we observed a symmetric recovery to baseline following ces- 
sation of stimulation that is also distinct from the differential extinc- 
tion of self-stimulation'’. However, a recent modelling study argued 
that apparent differences in extinction are consistent with equivalent 
learning rates following dMSN and iMSN stimulation”’ consistent with 
our observation of opponent, but symmetric effects. 

Here we proposed a circuit implementation by which a continuous 
parameter defining a purposive movement can be selectively reinforced 
by a stimulation-dependent enhancement of bidirectional synaptic 
plasticity. Importantly, it has been shown that striatal neurons are capa- 
ble of bidirectional synaptic plasticity***; however, plasticity is medi- 
ated by distinct signalling events in the two populations”’. Resolving 
the roles played by the intersection of these different cellular and circuit 
factors that govern bidirectional plasticity will be critical to understand 
the role of dopamine in instrumental learning. In addition to kine- 
matic parameters of movement, other aspects of reinforcement learning 
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are governed by continuous parameters such as rates*? or value®. The 
circuit implementation we propose could provide a general mechanism 
by which activity-dependent plasticity in striatum produces learned 
changes in continuous parameters with monotonic representations in 
neural activity. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Subjects. Experimental subjects were eight adult (over 2 months old) male mice, 
four each of Drdla-Cre (http://www.informatics.jax.org/allele/MGI:3836631) or 
Drd2-Cre (http://www.informatics.jax.org/allele/MGI:3836635) crossed with a 
mouse with an allele for Cre-dependent expression of channelrhodopsin-2 fused 
to enhanced yellow fluorescent protein (Ai32; https://www.jax.org/strain/012569). 
Mouse lines expressing Cre-recombinase were produced by the GENSAT 
project (GENSAT project, Rockefeller University, New York, USA)*!*? and obtained 
from the MMRC (https://www.mmrrc.org). Ai32 mice were obtained from 
Jackson Laboratory and produced by the Allen Institute for Brain Science (https:// 
www.alleninstitute.org)**. Number of animals and sessions were based upon 
previous studies using an intersession control model. Experimenters were not 
blinded to the condition or animal strain. All animals were handled in accordance 
with guidelines approved by the Institutional Animal Care and Use Committee 
(IACUC) of Janelia Research Campus which is AAALAC accredited. 

Animal care. Mice were individually housed in a temperature- and humidity- 
controlled room maintained on a reversed 12-h light/dark cycle. Following 1 week 
of recovery from surgery, the water consumption of the mice was limited to at least 
1 ml per day. Mice underwent daily health checks, and water restriction was eased 
if mice fell below 70% of their body weight at the beginning of deprivation. Mice 
were acclimated to head fixation and trained to lick drops of water sweetened 
with saccharin. 

Behavioural training. Mice spent 4-8 weeks adjusting to being head-fixed and 
learning to displace a side-mounted joystick, placed 2.7 cm away from their 
platform, to a threshold of roughly 0.5 cm. After this initial training, the thresh- 
old for a successful trial (‘criterion threshold’) was reduced 0.1 cm so that every 
joystick movement would be rewarded. Reaching movements were self-paced. Most 
movement amplitudes easily exceeded this amplitude threshold (Fig. 1c). 

Mice were restricted to consume 1.5 ml of water per day to maintain motivation 
for task completion. Movement was measured by recording voltage changes applied 
across the variable potentiometer connected to the joystick and were found to be 
linearly proportional to displacement over the range of movement amplitudes 
used by mice. At the start of each trial, joystick position was centred to coordinates 
(0,0), and animals were trained to manoeuvre the joystick to certain displacement 
thresholds equal to a set resistance change of a potentiometer. Both movements 
away from and towards the body were allowed. Delivery of a sweetened water 
reward (~0.05-0.1 ml per trial; controlled by an audible solenoid valve) signalled 
a successful movement and advancement to the next trial. Water delivery was 
delayed by 1,000 ms after the joystick position crossed a specific distance threshold. 
The threshold was set at an arbitrary, low value such that false positives were not 
detected, but all trial-initiating movements in well-trained mice were rewarded. 
A force of 0.1 N was required to displace and hold the joystick at an eccentric 
position. For reference, this is at least fivefold less than a mouse can pull towards 
itself using its forelimbs for several seconds**. 

No other task-related stimulus was present and the behaviour was performed in 

a darkened behaviour box. Rewards were followed by a 4,000 ms inter-trial interval 
(ITI) in which no movements would be rewarded. The joystick position at the end 
of the ITI (almost always near the central default position) was used as the initial 
position for the subsequent trial. Mice performed at least 125 trials per session. The 
initial 25 trials were only used for daily acclimation of the animal to the behavioural 
setup. Blocks of 50 trials were performed with the stimulation block followed by 
the no stimulation block. Many blocks could be completed, but only the first block 
from each condition were used in these analyses. Sham stimulation sessions were 
identical to stimulation sessions, including the attachment of the optic fibre to the 
mouse’ head, with the exception that the laser was not turned on. Stimulation 
sessions were conducted at random, and the data from the subsequent session 
were not used in analysis. 
Fibre implantation and optical stimulation. Implantation surgery was performed 
under full anaesthesia (1.5% isoflurane). The skull was exposed and fibre optic 
probes were unilaterally inserted 2.2 mm into the brain at 0.5 mm anterior and 
1.8mm lateral to bregma. Fibre optic probes were made of glass fibres (100 1m 
core) fitted with zirconia LC connectors. Head fixation caps were implanted at 
the end of the procedure and all elements and remaining skull were covered with 
dental acrylic as described previously*’. All surgical procedures were performed 
under aseptic conditions. 

Fibre implants were targeted to the dorsomedial aspect of the striatum (DMS)'. 
Extended Data Fig. 1a shows the localization of the tips of optical fibres implanted 
for dMSN and iMSN stimulation. To characterize this specific location in more 
detail we performed bilateral injections of a retrograde tracer (Lumiflor beads) into 
the approximate DMS location of fibre implants. We found extensive retrograde 
labelling of neocortical neurons over a relatively extended rostro-caudal axis that 
was biased towards the medial aspect of neocortex (Extended Data Fig. 1b, c), 
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consistent with previous results from our lab*°. Based upon the anatomical atlas of 
the mouse brain*” these cortical structures are annotated as M2 (secondary motor 
cortex) and Cg (cingulate cortex). However, we note that recent functional mapping 
of neocortex indicates that these sites are also within the boundaries of the rostral 
and caudal forelimb regions (Extended Data Fig. 1c)—areas that are sufficient to 
produce forelimb movements in response to microstimulation®®. 

Closed-loop photostimulation on high- and low-velocity blocks was accom- 
plished through online monitoring of instantaneous joystick velocity. Thresholds 
for triggering the laser were set for each animal such that approximately one-third 
of baseline movements would be suprathreshold. The velocity threshold within a 
session was fixed, but on occasion it was changed from one session to the next. 
Thresholds for all mice and all sessions were within 6% of each other. For stimula- 
tion of all movements, we set the velocity threshold low enough that all movements 
were suprathreshold. For low-velocity triggering (Fig. 3), we took advantage of 
the reliable, stereotyped nature of the reaches to predict peak velocity from early 
velocity. To trigger photostimulation, velocity needed to initially pass a low, ‘onset’ 
threshold while not exceeding a higher ‘too fast’ threshold for the next 20 ms. 
Using our real-time velocity triggering, we correctly stimulated 96% of upper-third 
(fast) reaches protocol and 84% of lower-third (slow) reaches protocol. Our false 
positive stimulation rate was 9% for both protocols. Photostimulation consisted 
10-ms pulses at 16.7 Hz for 450 ms from a 473 nm blue laser set so that the power 
at the tip of the implanted optic probe was 3-6 mW. This was at a frequency below 
that which individual neurons could reliably follow (Extended Data Fig. 7). Upper- 
third stimulation data consist of 22 stimulation and 25 sham sessions in dMSN 
mice, 26 stimulation and 20 sham sessions in iMSN mice. Lower-third stimulation 
data consist of 16 stimulation and 18 sham sessions in dMSN mice, 20 stimulation, 
16 sham sessions in iMSN mice. 

We next sought to estimate the extent of light spread based upon the laser 
power, fibre diameter and duty cycle of our pulse train using a combination of 
simulation’? and electrophysiology. The simulation result is shown in Extended 
Data Fig. 1d. In brief, the peak intensity of light stimulation was reduced to 1% 
maximum by ~1 mm below and 0.5 mm lateral to the optical fibre. To directly 
estimate the change in stimulation efficacy as a function of distance we performed 
recordings with a 4-shank silicon probe (NeuroNexusTech; ‘Buzsaki32’ site layout) 
on which 1 shank was affixed with an optical fibre. Consistent with the estimate 
of light scattering from the simulation we found that direct light activation was 
substantially weaker on the neighbouring shanks (Extended Data Fig. le). At the 
location of our fibres (Extended Data Fig. 1) the dorsal striatum extends for ~2 mm 
is all dimensions and the DMS roughly extends for 1 mm. Thus, these data indicate 
that direct photostimulation was restricted to the dorsal striatum. 

Behavioural analysis. All behavioural events were recorded on separate chan- 
nels at 1 kHz (BlackRock Microsystems; Salt Lake City, Utah). Data analysis 
was performed using written routines in MATLAB 2014 (MathWorks; Natick, 
Massachusetts) to extract individual forelimb movement trajectories (Extended 
Data Fig. 8). Quantification of individual movements considered only the outward 
component of the reach and quantified the peak amplitude and velocity. The begin- 
ning of the reach was assessed offline for each reach and was determined to be the 
first time point constituting the increasing velocity associated with that reach. The 
duration was computed as the full duration of the movement and tortuosity is a 
measure of the directness of the reach path, defined as the path length divided by 
the end point distance. Z scores were computed for each stimulation session, using 
the average sham session mean and standard deviation, within each animal, then 
combined across animals. Simulations of behavioural learning were implemented 
in MATLAB and are described in detail in the Extended Data. Unless otherwise 
noted, statistical significance refers to P< 0.05, two-tailed Student's t-test. 

Electrophysiology. Extracellular electrophysiology was performed in the dorsal 
striatum of awake, behaving mice. 32 channel silicon probe arrays with attached 
integrated optical fibres (NeuroNexus Tech; “Buzsaki32’ site arrangement) were 
acutely implanted in the dorsal striatum (centre of array was positioned 0.5mm 
anterior and 1.8mm lateral to bregma and —2.0 mm to —3.0 mm depth from 
surface). Electrodes were prepared for recording by reducing the site impedance 
below 750kOhm. Broadband continuous data (0.1 Hz—7.5 kHz) were recorded 
with simultaneous sampling of voltage from the joystick, the lick port, and digital 
signals from the behaviour control system (30 kHz sample rate on all channels, 
Blackrock Microsystems, Salt Lake City, Utah). Continuous voltage signals were 
highpass filtered (0.5-7 kHz) offline and events that exceeded four times the stand- 
ard deviation of the continuous voltage signal were extracted (spikes). Spike sorting 
into individual units was performed in MATLAB using custom-written software. 
Spikes were isolated according to waveform amplitude distribution and principal 
components of the amplitude array across the eight electrodes (~25 1m spacing) 
of each shank (n = 8) of the silicon probe array. The event times for each individual 
single unit were then aligned to movement start as extracted from the continuous 
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voltage signal from the joystick. Velocity-firing-rate slopes were computed using 
the mean activity of each unit over the epoch spanning 0 to 400 ms after reach 
initiation. The evoked response for each stimulus (Fig. 1f) was averaged across all 
isolated neurons, with spikes placed into 2-ms-wide bins. 

Pharmacology. We injected D1 and D2 receptor antagonists (SCH23390, 
0.02 mgkg! and sulpiride, 25 mgkg“', co-injected intraperitoneally) before stim- 
ulation sessions. Sham sessions were ones in which the same animals received 0.9% 
saline injection instead of drug. 

The MeSH learning rule. To determine whether the changes in reach velocity due 
to stimulation were consistent with a reinforcement learning rule, we developed a 
simple computational model: 


Mi+1= Mit wy (mj — Mj) + wsS — wp (mi — P)/|m; — P| 


where M is the Gaussian distribution of values m for a given movement parameter, 
from which m; is chosen at random on trial i. Performance reinforcement in the 
form of reward, r, shifts the mean of M relative the reward r, to which is always 
given and therefore will be present in every trial. Additional stimulation-induced 
reinforcement also occurs, shifting the mean according to the type of stimulation, 
S (+1 or —1 for dMSN or iMSN, respectively; 0 for no stimulation) at a fixed 
proportion of the reward rate, that is, w.=C,w,. Finally, we have added a restora- 
tive set point, P, which is based upon the original mean of the distribution. 
Corticostriatal circuit model. We implemented a simple simulation of 500 D1 
and 500 D2 striatal units. Activity was continuously varied between 0 and 1. The 
activity of a given unit was defined as: r;= w; x active, where active ¢[0,1]. 

Movement velocity was a product of the total cortical output summed with the 
net contribution of striatal activity (see schematic in Extended Data Fig. 6). This 
is an explicit model of the structure of the corticostriatal projection where striatal 
neurons receive collateral input from corticocortical and coorticofugal outputs 
from neocortex!. Thus, movement velocity was: mvmt_velocity = a~active + 
(>2d1_msn — y>/d2_msn. Simulations were conducted with a variety of weight- 
ings, but for examples in the manuscript we used a=0.5, 3=1, y=1. 

Synapse weights were updated incrementally according to a simple update 
equation: w;(t+ 1) = w;(t) + active X Qjearn — (1 — active) Giearn- 

Thus, if the unit was active its weight was increased by Qiearn or otherwise 
decreased by (earn if inactive. Various parameterizations of learning rates could 
be used, but we typically used Qjearn = — Stearn = 0.05 in the absence of photostim- 
ulation and Qjearn = — learn = 0.09 during photostimulation. Altered learning rates 
were only applied to the stimulated population. 

In vitro intracellular recordings. Methods for Extended Data Fig. 7 were as 
described previously””. Briefly, for the preparation of in vitro brain slices, mice 
were deeply anaesthetized with isoflurane, decapitated, and the brain placed 
into ice-cold modified artificial cerebral spinal fluid (aCSF) (in mM: 52.5 NaCl, 


100 sucrose, 26 NaHCOs, 25 glucose, 2.5 KCl, 1.25 NaH2PO,, 1 CaCl, 5 MgCl, 
and in .M: 100 kynurenic acid) that had been saturated with 95%O02/5%CO3. 
300-um thick coronal slices were cut (Leica VT1200S; Leica Microsystems, 
Germany), transferred to a holding chamber and incubated at 35°C for 30 min in 
modified aCSF (in mM: 119 NaCl, 25 NaHCOs, 28 glucose, 2.5 KCl, 1.25 NaH2PO,, 
1.4 CaCl,, 1 MgCh, 3 Na pyruvate and in j1M: 400 ascorbate and 100 kynurenic 
acid, saturated with 95%O./5%CO,) and then stored at room temperature. 

For recordings, slices were transferred to a recordings chamber and superfused 
with modified aCSF (in mM: 119 NaCl, 25 NaHCOs;, 18 glucose, 2.5 KCl, 1.25 
NaH>PO,, 1.4 CaCl, 1 MgCl, 3 Na pyruvate and in .M: 400 ascorbate and sat- 
urated with 95%O,/5%CO2) maintained at 32-34 °C, at a flow rate of 2-3 ml per 
minute. Patch pipettes (resistance 5-8 MQ?) were pulled on a laser micropipette 
puller (Model P-2000, Sutter Instrument Co., Sunnyvale, California) and filled with 
a KGluconate-based intracellular solution (in mM: 137.5 KGluconate, 2.5 KCl, 10 
HEPES, 4 NaCl, 3 GTP, 40 ATP, 10 phosphocreatine, pH 7.5). Intracellular record- 
ings were made using a MultiClamp700B amplifier (Molecular Devices, Sunnyvale, 
California) interfaced to a computer using an analogue to digital converter (PCI- 
6259; National Instruments, Austin, Texas) controlled by custom written scripts 
in Igor Pro (Wavemetrics, Eugene, Oregon). Software is available at http://www. 
dudmanlab.org. 
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Extended Data Figure 1 | Anatomical localization of stimulated 
neurons and their corticostriatal inputs. a, Schematic showing the 
histological reconstruction of optic fibre endpoints in dMSN (cyan) and 
iMSN (red) mice. Hemispheric segregation for display purposes only; 
actual implants were randomized left and right. b, Location of retrobead 
tracer injection just below the coordinates of fibre implant terminus. 

c, Location of the labelled projection neurons innervating the retrobead 
injection site corresponding to the rostral forelimb area (RFA, orange) 
and caudal forelimb area (CFA, yellow). d, Results of simulation of 

light intensity and spread through the brain based upon the model of 
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Stujenske et al.°°. Light intensity drops below 10% peak intensity 285 .m 
below the fibre. At this depth the lateral fall-off is a drop to 1% peak 
intensity 250 1m. Contour lines for 10% intensity and 1% intensity are 
overlaid in white and grey, respectively. e, Average photostimulation- 
evoked activity of individual neurons (each row= 1 neuron) sorted 

by electrode shank. We observed minimal fidelity in the light-evoked 
responses of neurons on the farthest shank consistent with the predicted 
fall-off in light intensity from simulations. Blue dashes at top identify 
timing of laser pulses. Robust entrainment of spiking to photostimulation 
was also observed in vitro (Extended Data Fig. 7). 
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Extended Data Figure 2 | Selective stimulation of MSNs produces 
changes in peak velocity. a, Density plot of movement trajectories 
showing the percentage of movements that passed through a given 
amplitude throughout time, aligned to movement onset. Particularly 

in the magnified plot on the right, we can see that very few movement 
trajectories passed near the central 0.2 cm within the first 100 ms. 

b, Peak velocity distributions for sham and stimulation data sets for 
dMSN stimulation (blue, left column), and iMSN stimulation (red, middle 
column). Top row shows complete distributions across animals. Note that 
no particular part of the distribution is pronounced following stimulation. 
Bottom row shows means for all experimental sessions by animal. Animal 
number is for indexing purposes only. Most experiments were carried 


dMSN lag iMSN lag 


out concurrently across animals. The right column shows the contrast 
ratio (difference divided by sum) for dMSN and iMSN stimulation effects 
(blue and red, respectively). These data show a steady mean shift in the 
data; for instance, iMSN stimulation (red) is positive for velocities slower 
than the sham (control) mean (an increase in frequency), and is negative 
for velocity values greater than the sham mean (a decrease in frequency). 
We have curtailed the contrast ratio plot where too few values existed to 
achieve reliable estimates (55cms~'). c, Autocorrelation of movement 
velocities for dMSN stim (blue), iMSN (red), and sham (black) data. 

d, e, Same analyses as b, c, but for those sessions in which stimulation 
occurred on the slower, lower third of movement velocities. 
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Extended Data Figure 3 | Trained animals can adjust amplitude to the jump in amplitude threshold. c, Distribution of reach amplitudes 
changing task requirements. a, Reach amplitudes from a sample of seven across sessions for pre- (green) and post- (black) amplitude threshold 
sessions in two mice in which the eccentricity of the threshold to receive jump. These data indicate that the mean amplitude was not saturated 
reward was suddenly increased at random. Green field identifies reaches and suggest that behaviour remains outcome-dependent (that is, 


performed with the increased amplitude threshold. Shaded area represents goal-directed). 
standard error of the mean. b, Success rate before (black) and after (green) 
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Extended Data Figure 4 | Variance of movement velocity does not change throughout a session. Each session was z-scored and the standard deviation 
for each movement number is plotted for dMSN (cyan) and iMSN (red) stimulation. 
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Extended Data Figure 5 | Non-selective stimulation does not affect 
motor control or initiation. a, b, ‘All-stim’ (a; 15 stimulation and 17 
sham sessions from 4 dMSN mice; 17 stimulation and 20 sham sessions 
from 4iMSN mice) and ‘random stim (b; 11 stimulation and 15 sham 
dMSN sessions from 3 dMSN mice, 8 stimulation and 12 sham sessions 
from 3 iMSN mice) summary data showing (top) the mean velocity 

as a function of movement number for dMSN (cyan) and iMSN (red) 
stimulation sessions and (bottom) histograms of the inter-move interval 
(IMI; left) and lick rate (LR; right). Shaded area indicates standard error 


of the mean. We found no differences between sham (black lines) and 
stimulation (coloured lines) sessions for either dMSN stimulation (cyan) 
or iMSN stimulation (red). c, Plot of average trajectory position aligned to 
stimulation onset for random dMSN (top) and iMSN stimulation (middle) 
for stimulation (coloured) and sham (black) sessions (in sham sessions, 
timing was randomly chosen, but no stimulation was given). Stimulation 
did not systematically induce forelimb movement. For reference, the 
bottom trace displays closed-loop dMSNs aligned to stimulation onset. 
Grey field represents the 450 ms stimulation period. 
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Extended Data Figure 6 | A corticostriatal circuit model that 
implements the MeSH learning rule. a, In the left panel, we present a 
schematic of the corticostriatal pathway consistent with known anatomical 
data’. Descending cortical outputs, largely from layer V of the neocortex, 
project subcortically and intracortically elaborating axon collaterals onto 
direct (blue) and indirect (red) MSNs in the dorsal striatum. By typical 
convention we assume that dMSNs have a net positive effect on behaviour 
(increase in velocity in this case) and iMSNs have a net inhibitory effect 
(decrease in velocity). These pathways are combined at the basal ganglia 
output nucleus (substantia nigra pars reticulata; not shown) and then 
combined with cortical drive to produce the net movement velocity. The 
model assumes that both dMSNs and iMSNs are positively correlated with 
cortical activity and with movement velocity. We assume a monotonic 
relationship between cortical activity and movement velocity*’. The model 
is initialized at a presumptive steady-state in which weights between 
cortical inputs and dMSN and iMSN units are noisy, but distributed 
around 0.5 and bounded [0,1]. Most simulations were performed with 

100 cortical units and 250 dMSNs and iMSNs each. Under all conditions 
weights are subject to updating according to a balanced plasticity rule 
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(inset) in which inactive units are subject to depression and active units 
are subject to potentiation. All synapses drift back towards a mean of 0.5 
thereby implementing a homeostatic set point to the weight distribution. 
Finally, photostimulation is assumed to enhance (90% increase) the 
magnitude of both depression and potentiation on stimulated trials in the 
stimulated population. Random sets of cortical inputs are assumed to be 
active on any given reach and are drawn from a gamma (shape parameters: 
8, 63) distribution that gives a distribution of velocities similar to that 
observed experimentally. Further details of the model are provided in the 
Methods. b, Example simulations of 100 trials (first 50 receive stimulation 
according to conditions described in legend followed by 50 unstimulated 
recovery trials). Curves reflect averages and standard errors of 100 
repetitions of the simulated condition. Other conventions as in main 
figures. c, Schematic of dendrite of MSN containing synapses active during 
limb movements. Synaptic plasticity enhanced by stimulation (inset in 
panel a) produces a net bias in synaptic weights when delivered in closed- 
loop. This bias can become uniform by permuting the active synapses on 
each simulated trial. 
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Extended Data Figure 7 | D1* and D2* MSNs can follow repetitive evoked spiking rate) and ~500 ms later by a train of 5 pulses of blue light 
stimulus trains of ~20 Hz photostimulation in vitro. Example MSNs of increasing duty cycle. All cells recorded were able to follow rapid phasic 
recorded in vitro in brain slices containing the DMS. Upper row are two stimulation and action potentials were reliably evoked on every stimulus 
example D1* dMSNs recorded from DMS of a Drdla-Cre::Ai32 mouse. of these 20 Hz trains (approximately similar to the stimulus trains used in 
Lower row are two example D2* dMSN recorded from DMS of a Drd2a- stimulation experiments described in the text). Examples were selected 
Cre::Ai32 mouse. 3 example traces shown from each. Spiking is evoked from seven neurons from three D1* animals, and four neurons from three 
by increasing current injection (traces selected for approximately similar D2* animals that were recorded for this particular stimulation design. 
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Extended Data Figure 8 | Characterization of movement onset and identified with the magenta dot. The beginning of the reach (green) was 
reach initiation threshold crossing time. A sample reach from a sham assessed offline for each reach and was determined to be the first time 
block of 50 is shown, with eccentricity in black and velocity in blue (in point, sampled at 1 kHz, constituting the increasing velocity associated 


right panel). The reach start is identified with the greed dot, the threshold with that reach. 
crossing, 9 ms later when the reach eccentricity surpassed 0.1 cm, is 
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Extended Data Table 1 | Lick and move rates from each animal for sham upper third stimulation sessions 


dMSN lick frequency (Hz): 


. 


uency (Hz 
Mouse60 
| 


dMSN inter-move interval (seconds): 


Mouse8 0 (0.3) | 16.3 (0-4) 


Increases in the mean for lick frequency indicate a faster rate of licking (more motor output), while increases in the mean for inter-move interval indicate a slower rate of licking (less motor output). 
Lick rate changes per animal were not significant (P > 0.4 for all but one IMSN animal, which exhibited a paradoxically faster lick rate, P=0.18). Inter-move interval changes were also not significant 
within each animal (P>0.3 except one dMSN animal which paradoxically showed an increased inter-move interval, P=0.15). Mouse number is only meant as an index and does not reflect timing or 
order of experiments, most of which were done in the same period of time. 
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FGF8 and SHH substitute for anterior-—posterior 
tissue interactions to induce limb regeneration 


Eugeniu Nacu!, Elena Gromberg', Catarina R. Oliveira!?, David Drechsel* & Elly M. Tanaka! 


In salamanders, grafting of a left limb blastema onto a right limb 
stump yields regeneration of three limbs, the normal limb and two 
‘supernumerary’ limbs’ *. This experiment and other research have 
shown that the juxtaposition of anterior and posterior limb tissue 
plus innervation are necessary and sufficient to induce complete 
limb regeneration in salamanders’'°. However, the cellular and 
molecular basis of the requirement for anterior—posterior tissue 
interactions were unknown. Here we have clarified the molecular 
basis of the requirement for both anterior and posterior tissue 
during limb regeneration and supernumerary limb formation 
in axolotls (Ambystoma mexicanum). We show that the two 
tissues provide complementary cross-inductive signals that are 
required for limb outgrowth. A blastema composed solely of 
anterior tissue normally regresses rather than forming a limb, 
but activation of hedgehog (HH) signalling was sufficient to drive 
regeneration of an anterior blastema to completion owing to its 
ability to maintain fibroblast growth factor (FGF) expression, 
the key signalling activity responsible for blastema outgrowth. In 
blastemas composed solely of posterior tissue, HH signalling was 
not sufficient to drive regeneration; however, ectopic expression of 
FGF8 together with endogenous HH signalling was sufficient. In 
axolotls, FGF8 is expressed only in the anterior mesenchyme and 
maintenance of its expression depends on sonic hedgehog (SHH) 
signalling from posterior tissue. Together, our findings identify 
key anteriorly and posteriorly localized signals that promote limb 
regeneration and show that these single factors are sufficient to 
drive non-regenerating blastemas to complete regeneration with 
full elaboration of skeletal elements. 

Limb regeneration in salamanders, including axolotls, proceeds 
in four steps: injury, blastema formation, sustained blastema out- 
growth, and differentiation. Previous studies have shown that blastema 
formation is dependent on the presence of nerves at the injury site, 
and that sustained outgrowth requires the presence of both anterior 
and posterior limb cells°-'°. Remarkably, despite the large cadre of 
molecular information that has accumulated on limb patterning in the 
past decades, the cellular and molecular logic underlying the require- 
ment for anterior and posterior cell juxtaposition in maintaining limb 
regeneration is unknown. Two models relying on different cellular 
principles have been proposed to explain this requirement. In the polar 
coordinate model, a graded spectrum of positional values around the 
limb circumference manifests in the form of cell surface properties. 
Disruption of this gradation as a result of transplantation or ampu- 
tation would be recognized by cells at the border, stimulating them 
to proliferate and generate daughter cells that intercalate the missing 
circumferential identities and thereby resulting in distal outgrowth 
of the limb tissue'’. By contrast, the boundary model subdivides the 
limb circumference into four domains: anterior—dorsal, anterior- 
ventral, posterior—dorsal, and posterior—ventral, and proposes that the 
intersection between the domains is required to induce the posterior 
domains to express a secreted morphogen that is required for growth 


and patterning of the limb!”. Thus, complementing, cross-inductive 
interactions between domains rather than local detection of differ- 
ences form the conceptual basis of the boundary model. 

Anterior-only blastemas (ABs) composed solely of anterior tissue 
can be created by deviating nerve endings to an anterior-facing 
lateral wound in the upper limb*”. A mid-bud blastema forms at the 
lateral site but eventually regresses, unless a patch of posterior limb skin 
(including dermal mesenchyme) is grafted to the anterior site; the graft 
allows the blastema to grow and form a fully patterned limb, called an 
accessory limb. We first searched for posterior-associated molecules 
that could stimulate complete limb regeneration from ABs and found 
that stimulation of HH signalling via the global application of smooth- 
ened agonist (SAG) was sufficient to prevent regression of an AB and 
to elicit regeneration of a limb with clear skeletal patterning along the 
proximo-distal axis and the full complement of constituent limb cell 
types (Fig. 1a, Extended Data Figs 1 and 2a, b, d and Supplementary 
Table 1). The accessory limbs displayed varying digit numbers 
that depended on SAG concentration (Extended Data Fig. 2a). 
We corroborated these findings by overexpressing human SHH in 
ABs via baculovirus-mediated delivery; this also led to patterned limb 
outgrowth (n= 8 of 32; Fig. 1b and Supplementary Table 1). These 
results show that activation of the HH pathway is sufficient to induce 
regeneration from an anterior-only blastema. Conversely, inhibition of 
HH signalling by global application of cyclopamine blocked accessory 
limb outgrowth from ABs complemented with a graft of posterior skin 
(Extended Data Fig. 2c), indicating that HH signalling is required 
for ectopic blastema outgrowth when a full complement of positional 
identities is present. 

To gain an insight into the mechanism of SAG-induced outgrowth 
of ABs, we investigated whether HH signalling induced a stable change 
from an anterior to a posterior determination state in parts of the 
ectopic limb tissue, or whether SAG treatment stimulated anteriorly 
determined cells to grow out a limb. Reamputation of SAG-induced 
accessory limbs (SAG-ALs) yielded either no outgrowth (n =2 of 5) 
or outgrowth of a spike (n =3 of 5), suggesting that a stable anterior- 
posterior discontinuity that could induce a secondary limb had not 
been established (Extended Data Fig. 3a). Transplantation is the clean- 
est way to test determination state. Therefore we transplanted skin 
from SAG-ALs to innervated anterior or posterior wounds and looked 
for secondary accessory limb formation (Extended Data Fig. 3b). 
Limb outgrowth occurred when the SAG-AL skin was transplanted to 
posterior (n= 15 of 21) but not anterior wounds (n= 0 of 22), indicating 
that the SAG-AL limb tissue retains an anterior rather than posterior 
determination state (Extended Data Fig. 3c, d and Supplementary 
Table 2). 

These results suggested that SAG may promote accessory limb 
outgrowth by upregulating a downstream growth-stimulating signal- 
ling loop that is involved in limb development; namely, posteriorly 
secreted SHH that stimulates upregulation of gremlin1 (grem1) and 
consequently fgf genes, whose protein products in turn maintain shh 


IDFG Research Center for Regenerative Therapies, Technische Universitat Dresden, 01307 Dresden, Germany. @Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, 
Germany. *Graduate Program in Areas of Basic and Applied Biology, Abel Salazar Biomedical Sciences Institute, University of Porto, 4099-003 Porto, Portugal. 
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Figure 1 | Activation of HH signalling or expression of FGF8 in anterior 
blastemas (ABs) is sufficient to drive outgrowth of accessory limbs. 

a, Accessory limbs develop from ABs treated with 10 nM SAG (n= 43 of 66), 
but not from ABs treated with water (n = 0 of 22). Alcian blue/alizarin 

red staining reveals the skeletal pattern with supernumerary digits and 
carpal elements in the accessory limb. Data from three experiments. See 
Supplementary Table 1 for details. b, Ectopic expression of SHH in ABs by 
baculovirus transduction is sufficient to drive accessory limb outgrowth 
(n=8 of 32, data from two experiments). Black dashed lines demarcate 
the outgrowing accessory limb. See Supplementary Table 1 for details. 

c, Ectopic expression of FGF8 in ABs by baculovirus transduction is 
sufficient to drive accessory limb outgrowth (n = 14 of 57, data from four 
experiments). Alcian blue/alizarin red staining reveals the skeletal pattern 
with a single digit in the accessory limb. See Supplementary Table 4 for 
details. d, Expression of FGF-SHH loop components in ABs untreated 

or treated with 10nM SAG and analysed with Nanostring nCounter 
technology. Each data point is the mean normalized count number from 
biological replicates (n = 3 or 4; for individual n values and complete 

data set, see Supplementary Table 7a; data from one experiment). Error 
bars represent s.d. For data points where the s.d. was low, no error bars 

are shown. The dashed line is for visual aid and indicates the level of 
mature (day 0) expression. For in situ hybridization of selected genes, 

see Extended Data Fig. 4. Scale bars, 2mm. dpw, days post-wounding. 

For alcian blue/alizarin red staining, cartilage is blue and ossifications 

are red. 


expression in posterior cells'*-*°. We therefore quantified the expres- 
sion of key limb development factors and limb blastema genes in 
SAG-treated versus untreated ABs by Nanostring nCounter analysis 
(a method for counting RNA transcripts) and by RNA in situ hybrid- 
ization. As expected, we observed upregulation of the blastema- 
associated transcription factors prrx1, msx1, msx2 and twist1 and of 
the anterior marker gli3 in ABs treated with SAG or water (Extended 
Data Fig. 4a, b and Supplementary Table 7a). Interestingly, a set of 
extracellular signalling factors that included fef8, fgf9, fgf10 and 
grem1 were sustainably and highly upregulated in SAG-treated ABs 
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Figure 2 ‘ FGFS is not detected in posterior blastemas (PBs) and its 
ectopic expression is sufficient to drive accessory limb outgrowth in 
the presence of intact endogenous HH signalling. a, Accessory limbs 
did not grow from PBs (upper row) of animals treated with 10 nM SAG 
(n= 1 of 32), while ABs on the contralateral forelimbs of the same animals 
(lower row) developed into accessory limbs (nm = 23 of 32), indicating 
efficient deactivation of HH signalling. Data from one experiment. 
Yellow dashed line demarcates the AB. See Extended Data Fig. 6a-c, f for 
controls and Supplementary Table 1 for details. b, PBs expressing FGF8 
after baculovirus transduction grow into accessory limbs (n = 27 of 50; 
data from three experiments). See Extended Data Fig. 6d-e, g for controls 
and Supplementary Table 5 for details. c, PBs expressing ectopic FGF8 
and treated with 8\.M cyclopamine, an inhibitor of HH signalling, do not 
develop into accessory limbs (n = 3 of 48; data from four experiments). 
See Extended Data Fig. 10 for controls and Supplementary Table 6 for 
details. d, Expression of FGF-SHH loop components in PBs treated with 
10 nM SAG, untreated or complemented with anterior skin, analysed 
with Nanostring nCounter technology. Note the lack of fgf8 induction 

by SAG despite the induction of the HH signalling targets glil and ptcl, 
which indicates functional HH signalling. Each data point is the mean 
normalized count from biological replicates (n = 2 or 3; for individual n 
values and complete data set, see Supplementary Table 7b; data from one 
experiment). Error bars represent s.d. For data points where the s.d. was 
low or where n= 2, no error bars are shown. The dashed lines are for visual 
aid and indicate the level of mature (day 0) expression. Extended Data 
Fig. 7 shows in situ hybridization of selected genes. Scale bars, 2mm. dpw, 
days post-wounding. 


but showed only an initial, unsustained upregulation in untreated 
ABs. Fgf17 was expressed only in ABs treated with SAG, and shh was 
not expressed in ABs at all (Fig. 1d, Extended Data Figs 4 and 8 and 
Supplementary Tables 3, 7a and 8). 
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Figure 3 | HH signalling is required for maintaining blastema 
outgrowth and regeneration. a, Inhibition of HH signalling with 
cyclopamine blocks progression of limb regeneration. Limbs treated 

with cyclopamine (8 1M) starting from day 6 after amputation do not 
regenerate (n = 0 of 14), whereas ethanol-treated controls do (n= 6 of 6). 
Data from three experiments. Scale bars, 2 mm. b, Nanostring nCounter 
gene expression analysis of limbs treated with 8 11M cyclopamine or 
ethanol. The schema illustrates the experimental setup. The blastema 
gene prrx1 shows a similar profile in control and cyclopamine-treated 
samples, whereas genes involved in the SHH-FGEF loop and the late genes 
fgf17 and hox13 diverge under the two conditions. Each data point is the 
mean normalized count of data from biological replicates (n = 3 or 4; for 
complete data set and individual n values, see Supplementary Table 10; 
data from one experiment). Error bars represent s.d. For data points where 
the s.d. was low, no error bars are shown. 


These data pointed to FGF signalling as a potentially critical effector 
of SAG in ABs. To test this theory, SAG-treated ABs were exposed to 
the FGFR1/FGFR3 signalling inhibitor PD173074 starting at day 10 
after wounding. Treatment of ABs with SAG alone induced accessory 
limb formation (n= 14 of 14), whereas addition of PD173074 blocked 
SAG-induced accessory limb outgrowth (n= 0 of 16) (Extended Data 
Fig. 5a). To test whether FGF signalling can drive accessory limb 
outgrowth, FGF8 was ectopically expressed in ABs by baculovirus 
transduction; this yielded single digit outgrowths (Fig. 1c, Extended 
Data Fig. 5b and Supplementary Table 4) without radius or ulna 
formation or anterior—posterior digit elaboration. These data show 
that FGF signalling is both necessary and sufficient to drive accessory 
limb outgrowth from ABs. 

We next tested whether SAG treatment was sufficient to induce 
outgrowth of posterior-only blastemas (PBs). PBs also regress unless 
complemented with an anterior skin graft. Interestingly, we found that 
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Figure 4 | Molecular circuitry of signals involved in anterior and 
posterior tissue requirement of regeneration. Anterior cells of the 
blastema express fgf8, whereas shh is expressed in the most posterior 
part. The initial upregulation of fgf8 in anterior cells is independent of 
interaction with posterior cells and the initial upregulation of shh in 
posterior cells is independent of interaction with anterior cells. Later, the 
expression of fgf8 and shh becomes interdependent in a positive feedback 
loop, probably involving GREM1, that is expressed primarily in the 
anterior half of the blastema but also in some posterior hand2-expressing 
cells. FGFs are the factors that can drive blastema outgrowth and limb 
regeneration. 


SAG did not induce PBs to form accessory limbs (n = 1 of 48) (Fig. 2a, 
Extended Data Fig. 6a—c, f and Supplementary Table 1). Nanostring 
analysis (Fig. 2d and Supplementary Tables 7b, 9) and RNA in situ 
hybridization (Extended Data Fig. 7 and Supplementary Table 3) 
for glil, ptcl and shh revealed that these factors were expressed in 
PBs with or without SAG, indicating that the HH signalling system 
is intact in PBs but is not sufficient to induce regeneration in this 
context. In addition, grem1 and fgf10 were also initially upregulated 
in PBs with or without SAG, and fgf9 was initially upregulated in 
untreated PBs. On the other hand, fgf8 and fgf17 were not upregulated 
in PBs with or without SAG (Fig. 2d, Extended Data Figs 7 and 8 and 
Supplementary Tables 3, 7b, 9). RNA in situ hybridization on normal 
blastemas revealed that the fgf8 signal localized to the anterior half 
of the blastema, being excluded from the posterior hand2-expressing 
zone. Interestingly grem1, fgf9, fgf10 and fgf17 were not excluded from 
the posterior hand2-expressing domain (Extended Data Fig. 9a, b). 
These results suggested that FGF8 is the anteriorly expressed FGF that 
is limiting in PB outgrowth. To test whether FGF8 expression could 
substitute for anterior cells in PBs to drive accessory limb outgrowth, 
we transduced the posterior regions of axolotl limbs with baculovirus 
expressing fgf8 and created PBs several days later. FGF8 expression 
alone was sufficient to induce accessory limb outgrowth in 54% of 
cases (n = 27 of 50), whereas no significant outgrowth was observed 
after control mCherry expression (Fig. 2b, Extended Data Fig. 6d, e, g 
and Supplementary Table 5). Interestingly, FGF8-mediated outgrowth 
from PBs required endogenous HH signalling, as co-treatment with 
cyclopamine blocked accessory limb outgrowth (Fig. 2c, Extended 
Data Fig. 10 and Supplementary Table 6). Our results show that FGF8 
expression together with endogenous HH signalling is sufficient to 
drive accessory limb outgrowth from PBs. 

We next investigated the relevance of our findings to normal regen- 
eration. Treatment of upper arm blastemas with cyclopamine starting 
at day 6 after amputation blocked regeneration (Fig. 3a) and yielded 
changes in the expression of extracellular signalling factors (Fig. 3b 
and Supplementary Table 10). The limb blastema marker prrx1 showed 
similar profiles in blastemas treated with cyclopamine or ethanol 
(control) up to day 16. In contrast, fef8, fef9, and fgf10 showed sus- 
tained expression under control conditions, whereas in cyclopa- 
mine-treated blastemas, the initial induction of fef8, fgf9, and fgf10 
up to day 8 was followed by downregulation. Interestingly, grem1 
showed a faster kinetic, peaking after 6 days before being downregu- 
lated in cyclopamine-treated blastemas, and shh expression started to 
be downregulated at day 10 in cyclopamine-treated blastemas. These 
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profiles are consistent with GREM1 being upstream of fgf gene expres- 
sion, and with FGFs being required for shh maintenance'*'?***. Two 
late-stage genes associated with distal limb bud development, fgf17 
and hoxa13, were strongly expressed only in ethanol-treated blaste- 
mas, suggesting that FGF-SHH signalling is required for substantial 
distalization during limb regeneration. These results show that the 
FGF-SHH loop is active during regeneration and is required for blas- 
tema outgrowth. Additionally, the anterior localization of fgf8, but not 
of fef9, fef10 or fgf17, in the limb blastema, and the lack of expression 
of fgf8 in PB cells, suggest that the cross-induction between anteriorly 
restricted fgf8 and posteriorly restricted shh (Extended Data Fig. 9a, c) 
is an essential part of normal limb regeneration (Fig. 4). 

In summary, we have delineated the mechanism underlying the 
anterior—posterior juxtaposition that has long been known to be 
required for limb regeneration. We have identified single factors that 
can substitute for anterior and posterior tissue functions: SHH for pos- 
terior and FGF8 for anterior tissue. This work shows that, in axolotls, 
the anterior and posterior tissue provide cells that are restricted in 
their competence to express complementary soluble signalling factors 
required for the growth and patterning of a mid-bud blastema, a result 
consistent with the boundary model of limb regeneration’. Posteriorly 
localized SHH is required for the maintenance of anteriorly restricted 
FGF8 expression and of non-restricted expression of FGF9, FGF10 
and FGF17. Sustained FGF signalling is the key limiting factor for 
extended proliferation of anterior and posterior cells and subsequent 
morphogenesis (Fig. 4). 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Data reporting. Animal distribution to experimental groups was randomized 
within the selection criteria outlined below for different experiments. The investi- 
gators were not blinded to allocation during experiments and outcome assessment. 
Animal procedures. Axolotls (Ambystoma mexicanum) used in all experiments 
were bred in captivity in our facility, maintained at 17-24°C in tap water and fed 
with artemia. Axolotls of the leucistic (d/d) strain were used in all experiments. 
Non-transgenic animals were used in all experiments unless indicated otherwise. 
When necessary, transgenic animals of the caggs::egfp genotype were used”®. 

Animals of length between 2 and 4cm from snout to cloaca (that is, between 4 
and 8 cm from snout to tail tip) were used in all experiments, except for the SAG 
treatment experiment in which larger 13-14-cm (snout to tail tip) animals were 
used (Extended Data Fig. 2b). In any given experiment, the range in the length 
(snout to cloaca) of animals used was no more than 0.4cm, to ensure comparable 
dynamics of regeneration. Among different replicate experiments, animals of sim- 
ilar size were used. For any given experiment, sibling animals were preferentially 
used. In cases when this was not possible, animals from different matings were 
evenly distributed among the different experimental subgroups. It was not possi- 
ble to determine the sex of the animals at the stage at which the experiment was 
performed and therefore this was not taken into account. 

All animal procedures were carried out in accordance with the laws and regu- 
lations of the State of Saxony, Germany. 

Surgical procedures. For all experimental procedures, axolotls were anaesthetized 
in 0.03% benzocaine (Sigma) before surgery. All surgeries were performed using 
Olympus SZX10 microscopes. 

To initiate normal limb regeneration, limbs were amputated either through the 
upper arm or lower arm. 

To generate ABs and PBs, wounds were created by cutting out a rectangular 
piece of skin from the anterior or posterior side of the distal half of the upper arm. 
Two ventral nerves were dissected free, severed at the elbow level and deflected 
to the wound. 

Skin transplantation to wounds was performed similarly. A slightly larger patch 
of skin compared to AB or PB production was removed from the host distal half 
of the upper arm and nerves were deflected to the wound. A square skin flap from 
the donor arm was placed on the distal half of the host wound, while leaving the 
proximal half uncovered with nerves sticking out. To allow healing of the trans- 
plant, animals were left for 15 min to 1h in a Petri dish humidified with 0.03% 
benzocaine; afterwards animals were transferred into tap water. 

Treatment of animals with chemical agonists or inhibitors. In all experiments 
that required animals to be treated with chemical compounds, the compounds 
were dissolved in tap water and animals were kept in that solution. Solutions were 
made freshly and exchanged daily. The chemical compounds used in the study 
were: smoothened agonist SAG (EMD Millipore), PD173074 (Tocris Bioscience), 
cyclopamine (LC labs), dimethyl sulfoxide DMSO (Sigma) and ethanol (99.9% 
VWKR chemicals). Chemical solutions were prepared in glass bottles covered with 
aluminium foil that protected the solutions from light, and stirred for a minimum 
of 30 min with a magnetic stirrer. Solutions were distributed to the boxes in which 
animals were kept. The animals were then transferred into the boxes with solutions 
and kept in darkness. During treatment, animals were fed daily for 30 min to 1h 
under lab light or preferentially in darkness. Feeding was started 30 min to 1h 
before the chemical solution was exchanged. 

SAG treatment of ABs and PBs. ABs and PBs were generated by surgical methods 
as described above. Treatment with SAG was performed as described above and 
started immediately after surgery or 3 days after surgery for some PB samples. 
The duration of treatment with SAG varied; the duration of treatment for each 
experiment is indicated in Supplementary Table 1. 

Cyclopamine treatment of blastemas, ABs complemented with posterior skin, 
and PBs complemented with anterior skin. Limbs were amputated or injured as 
above. An 81M cyclopamine solution in tap water was prepared from a stock of 
4mM cyclopamine in ethanol as described. We found that the effect of cyclopa- 
mine was dependent not only on the final concentration but also on the volume of 
the solution in which animals were kept. In all experiments, the volume of solution 
per animal was 60 ml. A control solution of ethanol in tap water with similar dilu- 
tion and volume was also prepared. Treatment with solutions was started 6 days 
after surgery. The duration of treatment is indicated in the experimental schemas 
associated with each experiment. 

Exogenous expression of genes in ABs or PBs by baculovirus transduction. 
Baculovirus technology”’~*? was established in our lab for use as a transient expres- 
sion system in the axolotl and will be described elsewhere. Baculovirus was pseu- 
dotyped with vsv-ged gene, which was inserted into the rescue vector under the 
baculovirus polyhedrin promoter. Genes of interest (human SHH, axolotl fgf8, and 
mCherry) were cloned into the baculovirus rescue vector under the CMV promoter 
using standard restriction enzyme methods. 


LETTER 


The generation of pseudotyped baculoviruses carrying the gene of interest 
was carried out by co-transfection of the rescue vector together with 
replication-incompetent baculovirus DNA into a modified Spodoptera frugiperda 
cell line (expresSF+, ProteinSciences Corporation). Upon culture expansion, 
recombinant baculoviruses were collected, concentrated and purified. Baculovirus 
titre was assessed by end-point dilution assay in SF-9 Easy Titer cells, a modified 
Spodoptera frugiperda cell line expressing a reporter egfp gene under the control 
of the baculovirus polyhedrin promoter*”. 

Expression of SHH in ABs by baculovirus transduction was achieved by inject- 
ing baculovirus—SHH into the anterior sides of mature upper arms, followed by 
surgery 5 days after injection. Titres of injected baculovirus suspension are shown 
in Supplementary Table 1. 

Expression of mCherry or FGF8 was achieved by injecting the corresponding 
baculovirus into the posterior or anterior sides of mature upper arms. Titres of the 
viral suspensions injected were: mCherry, 2.7 x 10'° or 9.4 x 10!°; FGF8, 2.7 x 101° 
or 8.5 x 102°, 

Injections were performed using a glass capillary needle pulled on a micropi- 
pette puller (Sutter Instrument Co. model P-97). Glass capillaries (1.2 mm OD, 
0.9mm ID borosilicate glass) were acquired from Harvard Apparatus. For injection 
into the anterior upper arm, either a tiny incision was made with surgical scissors 
(FineScienceTools No 15024-10) or the skin was punctured with a 30G needle 
(BD Bioscience) on the dorsal side of the upper arm just proximal to the elbow. 
The needle was inserted through the incision/puncture and the virus was injected 
into the biceps and surrounding connective tissue. For injection into the posterior 
upper arm, an incision/puncture was made posterior—dorsally just above the elbow, 
the needle was inserted through the incision/puncture and the virus was injected 
into the triceps muscles and surrounding connective tissue. 

Statistical analysis of outgrowths. First, outgrowths were sorted into several cat- 
egories according to the extent of outgrowth (see Supplementary Tables 4 and 6). 
Next, different categories were grouped into two supra-categories: substantial out- 
growth versus non-substantial outgrowth, with outgrowths that have at least two 
cartilaginous segments being the cut-off for what marks a substantial outgrowth. 
We then used the Cochran-Mantel-Haenszel test with continuity correction for 
repeated tests of independence on categorical data to assess whether different 
treatments had statistically significant effects on the likelihood of substantial out- 
growths. The two groups of category were: type of manipulation (for example, ABs 
+ FGF8 versus ABs + mCherry) and type of outgrowth (for example, substantial 
outgrowth versus non-substantial outgrowth). 

Choice of sample size in experiments. When we used SAG to investigate the 
role of HH signalling in accessory limb outgrowth from ABs, the sample size was 
chosen at random for the first experiment. For subsequent experiments it was 
chosen based on the rate of accessory limb induction in the first experiment. In 
experiments involving baculoviral delivery of human SHH or axolotl fgf8 to ABs, 
the sample size was also chosen based on the rate of accessory limb induction 
from ABs by SAG. 

In experiments testing the potential of SAG to induce accessory limbs from PBs, 
we used four conditions: PBs treated with SAG, PBs treated with water (negative 
controls), PBs complemented with anterior skin (positive controls), PBs comple- 
mented with posterior skin (negative controls). The sample size for all groups 
was based on the rate of accessory limb induction from ABs by SAG, as described 
above. 

In experiments involving baculoviral delivery of mCherry or fgf8 to PBs, the 
sample size was chosen based on the induction rate of accessory limbs from PBs 
complemented with anterior skin, taking into consideration the false positive rate 
of accessory limb induction from PBs treated with water, as identified in the PB 
experiments described above. 

In experiments where the determination state of SAG-induced accessory limbs 
from ABs was assayed by transplanting accessory limb skin onto host anterior or 
posterior wounds, the sample size was based on preliminary experiments and 
resulting information about the induction rate of accessory limb outgrowth from 
PBs complemented with anterior skin or ABs complemented with posterior skin. 
Nanostring gene expression analysis. For probe design, sequences of genes of 
interest were taken from the axolotl transcriptome that has been assembled from 
Illumina sequencing reads (to be published elsewhere). For genes whose sequence 
was validated by the Sanger method, the Nanostring probes were designed to the 
whole validated sequence. For non-validated genes, the Nanostring probes were 
designed only to the open reading frame of the sequence. Probes were designed 
by Nanostring Technologies. 

RNA isolation for Nanostring analysis was done by a tandem purification using 
Trizol (Ambion) and Qiagen Micro Kit (Qiagen). The tissue was homogenized 
in Trizol using a polytron homogenizer. After chloroform addition and centrif- 
ugation, the aqueous phase was collected. An equal volume of 70% ethanol was 
added to the collected aqueous phase and vortexed. The solution was then loaded 
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onto a Qiagen Micro Kit column, and RNA purification was done following the 
manufacturer’s protocol. 

For Nanostring hybridization, each hybridized sample corresponded to one 
biological replicate. Each biological replicate consisted of a single blastema, AB 
or PB. In each hybridization we used 200 ng of RNA (analysis of blastemas) or 
300 ng of RNA (analysis of ABs and PBs). Samples were hybridized according to 
the manufacturer's instructions, and subsequently analysed on the Nanostring 
Digital Analyzer to obtain the data on the number of counts. Data were normalized 
based on the counts of housekeeping genes rp/4 and ef1 with Nanostring nSolver 
software. Statistical analysis (one-way ANOVA with false discovery rate correc- 
tion, followed by Tukey post-hoc test) was performed in Genespring GX (Agilent) 
software. Graphs showing expression profiles of genes were generated with Prism6 
(Graphpad) software. 

Alcian blue/alizarin red staining. The regenerates were fixed overnight in 
MEMFA (1 x MEM salts, 3.7% formaldehyde), followed by three 5-min washes 
in PBS. The limbs were dehydrated in 25% and 50% ethanol (EtOH) for 10 min 
each. They were then placed in alcian blue solution and kept at 37 °C for approxi- 
mately 1h or until all cartilage elements were visibly stained. The limbs were then 
washed in EtOH-acetic acid mix for 1 h, 95% EtOH for 15 min and twice in 1% 
KOH solution for 10 min each time, until the soft tissue became clear. They were 
then placed in alizarin red solution, and stained at room temperature until the 
ossified parts of the bones became visibly stained (usually approximately 25 min). 
They were subsequently washed in 1% KOH-glycerol mix. Finally, the limbs 
were placed in 20% glycerol-PBS solution and imaged using an Olympus SZX16 
microscope and Cell F software (Olympus). 

Cryosectioning. Limbs with accessory limbs were collected and fixed in MEMFA 
overnight at 4°C. Afterwards, the limbs were washed for 2 x 5min and 4 x 10 min 
in 1x PBS and placed in 20% sucrose solution overnight at 4°C. The next day they 
were embedded in optimum cutting temperature (OCT) compound and frozen in 
plastic blocks. Samples were sectioned into 10-j1m-thick slices that were collected 
on Superfrost-plus microscopy slides (Menzel-Glaser). The sections were allowed 
to air dry for 1h before being stored at —20°C. 

Immunostaining. The following antibodies were used for staining: anti-Meis1/2/3, 
clone 9.2.7 (EMD Millipore); anti-beta-3-tubulin (R&D systems MAB1195); 
anti-myosin heavy chain (generated in-house at MPI-CBG antibody facility). 
Staining was performed using protocols previously described)”. 

Paraffin embedding and sectioning. Limbs with accessory limbs or blastemas were 
collected and fixed in MEMFA for 3 days at 4°C. Afterwards, the limbs were washed 
for 2 x 5 min in 1 x PBS treated with diethylpyrocarbonate (DEPC) and sequen- 
tially dehydrated in 25%, 50%, 75% and 100% ethanol in DEPC-treated water. The 
samples were left overnight in 100% ethanol. The next day, the limbs were washed 
for 10 min in fresh 100% ethanol, then for 3 x 40 min tol h in 100% xylol at room 
temperature, and lastly for 3 x 1h in melted paraffin at 70°C. The limbs were placed 
on the bottom of disposable plastic base moulds filled with molten paraffin and left 
on a smooth surface to solidify at room temperature. Samples were cut into sections 
with a thickness of 7-10,1m and transferred onto superfrost-plus microscopy slides 
(Menzel-Glaser) covered with DEPC-treated water. The slides were placed on a 
heating platform at 37°C and allowed to air dry for at least 1h. 

Isolation of genes for in situ hybridization. We used 10-12 freshly frozen 9-day 
or 15-day blastemas for total RNA extraction using a standard trizol-based 
protocol. A SuperScript III kit (Invitrogen) was used for cDNA synthesis according 
the manufacturer's protocol. cDNA was used for PCR with appropriate primers to 
isolate the genes of interest. PCR products were run in 2% agarose gel and bands of 
the expected size were cut out and purified from the gel. PCR products were cloned 
into a PCR-II Topo vector, using TOPO TA Cloning kit (Invitrogen). Accession 
numbers are listed after the Acknowledgements section. 

For grem1 in situ hybridization we pulled out a sequence of 937 bp by PCR using 
the following primers: forward TCGCCTGACACTACATAGC, reverse AGAT 
GTGCAAAAGTTCAGAGAT. For hand2 in situ hybridization we pulled out 
a sequence of 632 bp by PCR using the following primers based on the newt 
HAND2 sequence: forward TGAGCCTGGTGGGGGGCTTCC, reverse 
AGCGCCCAGACGTGCTGCGGCCA. For gli3 in situ hybridization we iso- 
lated a sequence of 933 bp by PCR using the following primers: forward ATCAT 
TAATAAAGAAGGAGATCCA, reverse ATAGGTCTCTGGGTAGGAAAAG. 
For hoxd13 in situ hybridization we isolated a sequence of 735 bp by PCR using 
the following primers: forward GGCCGAGGTTAGTTTTTATAACCGCA, 
reverse AAAGGAAGACTTCCAGAAGTGCGAGCTCT. For fgf8 in situ hybrid- 
ization we isolated a sequence of 1,148 bp by PCR using the following primers: 


forward TTCACGTGCCTCACCTCCACTACCTCAGCA, reverse ATATAAAT 
TGTTCTTCTAAAAGTCCACTGG. For fgf9 in situ hybridization we isolated 
a sequence of 972 bp by PCR using the following primers: forward ATTAAACCCG 
CTTTGATTTCTTIG, reverse GTAGAGATTAGATG AGTATGTGTTA. Fef10 
and fgfl7 sequences were isolated by screening an EST library of 6-day axolotl 
tail blastema cDNA. For ptcl and shh in situ hybridization, we used sequences 
previously described*?. 

In situ hybridization probe synthesis. The probes were synthesized using lin- 
earized plasmids containing genes of interest, SP6 or T7 polymerase (produced 
in-house), and DIG-labelled nucleotide mixture (Roche). Synthesis was performed 
for 2-5h at 37°C. RNA was purified from the RNA synthesis mix using RNeasy 
MinElute Cleanup Kit (Qiagen) and a small amount of purified RNA was run ona 
1% agarose gel to assess its quality. Concentration was measured on NanoDrop and 
adjusted to 100 ng! with hybridization buffer (50% formamide, 10% dextran, 
5x SSC, 0.1% Tween, 1 mgml! yeast RNA, 100mg ml“ heparin, 1 x Denhardt's 
solution, 0.1% CHAPS detergent and 5mM EDTA). 

In situ hybridization. Slides were deparaffinized by washing 3 times in 100% xylol 
and 3 times in 100% ethanol for 10 min each step, dried for 15 min and placed 
overnight at 70°C in hybridization buffer containing probes at the following con- 
centrations: fgf8 500 ngyl~’, fef9 300 ng ul, fef10 300 ng, fef17 300 ng yl’, 
gli3 500 ng ul’, grem1 500ngyl~*, hand2 200 ngyl', shh 200 ng’. On the next 
day, slides were washed in salt solutions at 70°C as follows: 30 min in 5 x SSC, 
40 min in 2 x SSC twice, 20 min in 0.2 x SSC. To reduce background signal com- 
ing from nonspecific probe binding, slides were washed in Tris-sodium-EDTA 
(TNE) solution for 2 x 10 min, treated with 20\1gml1~' RNase A (Sigma) in TNE 
for 15 min, and washed in TNE twice, for 2 min and 10 min. After RNase treatment, 
slides were washed for 2 x 5 min and 1 x 10 min in maleic acid buffer (MAB) and 
blocked for 1h with blocking buffer (1% blocking reagent (Roche) dissolved in 
MAB). Then slides were incubated with anti-DIG-AP antibody (Roche) diluted 
1:5,000 in blocking buffer for 1-2 h at room temperature or overnight at 4°C. 
Slides were washed at room temperature in MAB for 2 min, 5 min and 3 x 10 min, 
followed by 2 x 10 min washes in AP buffer. Slides were developed in BM Purple 
solution (Roche) or preferentially a substrate working solution containing NBT/ 
BCIP (Roche) and 5% polyvinyl] alcohol (Sigma) at 37°C in the dark until the 
signal became clearly visible. The reaction was stopped by washing in 1 x PBS 
several times. 

In the case of the fgf9 in situ hybridization signal in untreated 12-day ABs 

(Extended Data Fig. 4f), the signal was localized to the base of the blastema but 
was not considerably stronger than the background signal that is typically seen in 
muscle and nerve for limb in situ hybridization samples. Combined with observed 
reduction in counts of fgf9 after 8 days in Nanostring data (Fig. 1d), we cannot 
conclusively say whether this basal fgf9 signal represents bona fide expression or 
background signal. 
Imaging and figure assembly. Images of antibody-stained samples were acquired 
ona Zeiss Axioobserver Z.1 using AxioVision 4.8.1.0 software (Zeiss). Images of 
in situ-stained samples were acquired on an Olympus BX61VS110 system using 
Olympus VS-ASW software. 

Images were processed with Fiji or Photoshop software. When adjusted, contrast 
and brightness were adjusted uniformly across the image. 
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Extended Data Figure 1 | SAG-induced accessory limbs contain Expression of MEIS is seen in the host upper arm on which the accessory 
muscles, axons, and primarily lower arm and hand elements. limb was induced. MEIS staining is also observed in the epidermis, 

a, Immunostaining for myosin heavy chain (MHC) reveals muscle fibresin | however epidermis does not play a role in positional identity. d, Overlay of 
the accessory limb (AL). b, Immunostaining for 83-tubulin reveals axons anti- MHC (green), anti-33-tubulin (white), anti- MEIS (red) and Hoechst 
in the accessory limb. c, Immunostaining for MEIS, an upper arm identity (marks nuclei, blue) staining. Number of accessory limbs analysed, n= 5. 
marker, reveals that the accessory limb contains primarily lower arm and Scale bar, 1 mm. 

hand elements, as evidenced by the absence of MEIS in the accessory limb. 
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Extended Data Figure 2 | HH signalling is sufficient for accessory 

limb outgrowth from ABs and is necessary for outgrowth from ABs 
complemented with posterior skin. a, Treatment of ABs with lower 
concentrations of SAG still results in accessory limb outgrowth, but 

with a decreased number of digits. Data from a single experiment. 

b, SAG (40 nM) also induces accessory limb outgrowth from ABs in large 
animals (13-14cm snout to tail tip), indicating that this effect is not 
size-dependent. Data from a single experiment. c, Blocking HH signalling 


@ start treatment 


20d 


@ image 


with cyclopamine (8 |1M) in ABs complemented with posterior skin blocks 
regeneration. Data from a single experiment. d, Percentage of accessory 
limb outgrowths from ABs treated with different concentrations of SAG. 
See Supplementary Table 1 for details. Short blue horizontal lines represent 
average percentages for each condition. The dotted line is for visual 

aid and intersects the y axis at 0%. Numbers in upper right corners of 
photomicrographs indicate the number of accessory limb outgrowths out 
of the total number of ABs. dpw, days post-wounding. Scale bars, 2mm. 
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Extended Data Figure 3 | SAG treatment does not make the 
determination state of accessory limbs more posterior. a, SAG-induced 
accessory limbs (SAG-AL) regenerate at most a spike after amputation, 
indicating a lack of functional positional discontinuity. SAG-ALs were 
amputated and imaged at 14 and 21 days post-amputation. No further 
growth of the spike was observed 21 days post-amputation. Red or 

black lines, amputation plane. No SAG-ALs regenerated a patterned 
accessory limb after amputation (n= 0 of 5). Experimental data from 

a single experiment. Alcian blue/alizarin red staining: cartilage is blue, 
ossifications are red. b-d, Testing determination state of SAG-ALs by skin 
transplantation to ABs or PBs. b, Schema illustrating the experimental 
setup and potential outcomes. Accessory limbs were generated by SAG 
treatment of ABs from transgenic animals constitutively expressing GFP. 


one to posterior host site 


CONCLUSION 
+ > Accessory limb skin 


has anterior identity 


DW 


possible 
— SP outcomes 


> Accessory limb skin 
has posterior identity 


The SAG-ALs were divided into a medial and lateral side, with skin 

from one side transplanted to an anterior host site and skin from the 
opposite side transplanted to a posterior host site. Two possible outcomes 
were expected, depending on whether SAG posteriorized cells during 
initial accessory limb outgrowth. c, A host AB with a skin transplant 

from a SAG-AL shows no accessory limb outgrowth, indicating no 
determination of posterior identity during SAG-AL outgrowth (n=0 of 22). 
d, A host PB with a skin transplant from the same SAG-AL as in ¢ results 
in new accessory limb outgrowth, revealing the anterior identity of the 
transplanted skin (n= 15 of 21). Data from a single experiment. See 
Supplementary Table 2 for details. DF, dark field images. GFP panels show 
the transplanted skin. dpt, days post transplantation of skin. Scale bars, 2 mm. 
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Extended Data Figure 4 | Gene expression analysis of ABs with and 
without SAG. a, ABs express blastema markers prrx1, msx1, msx2, and twist1, 
and anterior marker gli3. The y axis indicates the number of normalized 
counts obtained by Nanostring nCounter technology. Each data point is the 
mean normalized count number from biological replicates (n =3 or 4; 

for individual n values and complete data set, see Supplementary Table 7a; 
data from one experiment). Error bars represent s.d. For data points where 
s.d. was small, no error bars are shown. b, gli3 is expressed at 8 and 12 

days in SAG-treated or untreated ABs. c-h, Expression of FGF-SHH loop 
components in SAG-treated versus untreated ABs. c, d, f, In ABs, fgf8, fef9, 
and grem1 maintenance, but not induction, requires SAG treatment. 

c, d, fgf8 and grem1 are upregulated in ABs in the presence or absence of SAG 
at 8 days, but are downregulated at 12 days unless SAG is present. f, fgf9 is 
expressed at the tip of the bumps at 8 days and maintains this expression at 
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a000 tk SAG [10nM] 
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days post-wounding 


SAG 


12 days 


12 days 


12 days 8 days 12 days 


the tip in 12-day SAG-treated samples. In untreated 12-day samples there is 
no more fgf9 expression at the tip, although there seems to be expression at 
the base of the bump. This could represent a secondary source of fgf9 that 

is not part of the FGF-SHH circuitry or it could be an unspecific signal; see 
Methods. g, fgf10 was expressed in all conditions. h, Unlike other fgf genes, 
fgfl7 expression required SAG at all time points. fgfl7 was not expressed in 
untreated samples at either 8 or 12 days, but was expressed at both time points 
when treated with SAG. e, SAG is not sufficient to upregulate shh in ABs. 

Shh is not expressed in ABs at 8 or 12 days in the presence or absence of SAG. 
For detailed information on the number of samples analysed and results, see 
Supplementary Table 3. Inserts show magnification of positive cells. Images 
of 12-day SAG-treated blastemas in panels f-h show different sections from 
the same limb that were hybridized separately for each of the indicated genes. 
Scale bars: all 8-day images, 100 1m; all 12-day images, 200 1m. 
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Extended Data Figure 5 | FGF signalling is required for SAG-induced 
accessory limb outgrowth from ABs, and is sufficient to induce single- 
digit accessory limb outgrowth from ABs. a, FGF signalling is required 
for SAG-induced accessory limb outgrowth. ABs were treated with 15 nM 
SAG or water starting on the day of wounding. At 10 days post-wounding, 
concomitant treatment with DMSO or 151M PD173074 (inhibitor of 
FGFRI1 and FGFR3) was initiated. Treatment was terminated at 22 days 
post-wounding. Images of ABs were taken at 10, 15 and 30 days post- 
wounding. ABs treated with water and DMSO fail to develop accessory 
limbs (n= 1 of 16). ABs treated with SAG and DMSO form accessory 
limbs (n= 14 of 14). ABs treated with SAG and PD173074 grow until 


22d 30d 
stop image 
treatments 


outgrowths 
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14 out of 14 


0 out of 16 


experiment 1 
experiment 2 
experiment 3 


experiment 4 


initiation of PD173074 treatment, after which growth halts and ABs fail to 
develop accessory limbs (n= 0 of 16), eventually regressing. This indicates 
that FGF signalling acts downstream of SAG and is required for accessory 
limb outgrowth. Dashed yellow lines demarcate ABs. The experimental 
data come from two experiments. dpw, days post-wounding. Scale bars, 

1 mm. b, Ectopic baculovirus-induced expression of FGF8 in ABs results 
in single digit accessory limb outgrowths. Percentage of substantial 
outgrowths from ABs transduced with fgf8 or mCherry (negative control). 
See Supplementary Table 4 for details. Short blue horizontal lines represent 
average percentage for each condition. The dotted line is for visual aid and 
intersects the y axis at of 0%. 
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Extended Data Figure 6 | FGF8, but not HH signalling, is sufficient 

to drive accessory limb outgrowth from PBs. a-c, f, HH signalling is 
not sufficient to drive accessory limb outgrowth from PBs (for details 

see Supplementary Table 1). a, SAG is sufficient to drive accessory 

limb outgrowth from ABs (n= 23 of 32) but not PBs (n= 1 of 32) on 

the contralateral limb of the same animal. Data from one experiment. 

The images are the same as in Fig. 2a. b, Untreated PBs also produce no 
accessory limb outgrowth (n= 1 of 37; data from two experiments). 

c, Positive control: PBs complemented with anterior skin transplants from 
animals that constitutively express GFP grew into accessory limbs without 
SAG treatment (n= 10 of 19, data from two experiments). f, Percentage 
of accessory limb outgrowths from PBs without treatment (H,O), treated 
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with SAG, or complemented with anterior skin. d, e, g, FGF8 is sufficient 
to drive accessory limb outgrowth from PBs (data from three experiments; 
for details see Supplementary Table 5). d, PBs expressing baculovirus- 
induced FGF8 grow into accessory limbs (n= 27 of 50). The images are the 
same as in Fig. 2b. e, Negative control: PBs expressing baculovirus-induced 
mCherry do not grow into accessory limbs (n = 3 of 49). g, Percentage 

of accessory limb outgrowths from PBs transduced with fgf8 or mCherry 
(negative control). Short blue horizontal lines in all graphs represent 
average percentage for each condition. Dotted lines are for visual aid and 
intersect the y axes at 0%. Dashed yellow lines demarcate blastemas. 

DF, dark field images; dpw, days post-wounding. Scale bars, 2mm. 
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Extended Data Figure 7 | PBs do not express fgf8 or fgf17 even though 
intact HH signalling is present, but do express grem1, fgf10 and fgf9. 

a, g, PBs do not express fgf8 and fgf17 even upon SAG treatment. Fgf8 
and fgf17 are not expressed in PBs at 8 or 12 days in the presence or 
absence of SAG, but are expressed in positive control PBs containing an 
anterior skin transplant. b-d, f, Early induction of grem1, shh, ptcl (a HH 
signalling target) and fgf10 in posterior wounds is independent of SAG, 
and SAG is not sufficient for maintenance of grem1 and shh expression. 

b, c, Grem1 and shh are present in PBs at 8 days and are downregulated 
by 12 days independent of SAG treatment. This observation differs 
slightly from the Nanostring nCounter data (Fig. 2d) where grem1 is 
downregulated at 15 days. The discrepancy is probably due to the variation 
in regeneration timelines across different experiments. Grem1 and shh are 


present in control PBs containing an anterior skin transplant at both 8 and 
12 days. d, Ptcl is expressed at 8 days in PBs that were untreated or treated 
with SAG or complemented with an anterior skin graft. f, Fgf10 is present 
in all conditions at 8 and 12 days. e, Fgf9 is expressed in a few cells at the 
tip of 8-day untreated PBs, but is absent from 12-day untreated PBs and 8- 
or 12-day SAG-treated PBs. Fgf9 is present in control PBs complemented 
with anterior skin grafts. Detailed information on the number of samples 
analysed and results can be found in Supplementary Table 3. Inserts show 
magnification of positive cells. Images of 12-day PBs complemented with 
anterior skin (e-g) show different sections from the same limb that were 
hybridized separately for each of the indicated genes. Scale bars: all 8-day 
samples, 100 1m; all 12-day samples, 200 1m. 
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Extended Data Figure 8 | Schematic illustration of the expression 
pattern of fgf8, fef9, fgf10, fef17, grem1 and shh in ABs and PBs 

treated with SAG or untreated and PBs complemented with anterior 
skin transplant. ABs upregulate fe/8, fgf9, fgf10 and grem1 but not 

shh in the absence of SAG at 8 days post-wounding, indicating that 

initial upregulation of fgf8, fgf9 and fgf10 (yellow) and grem1 (blue) is 
independent of SAG. ABs downregulate expression of fgf8 and grem1 by 
12 days, and eventually regress. ABs treated with SAG maintain fgf8 and 
grem1 expression at 12 days, indicating that HH signalling is necessary for 


Posterior wound 
+ anterior skin transplant 
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their maintenance. SAG-treated ABs also express fgf17. ABs treated with 
SAG eventually develop accessory limbs (ALs). Untreated PBs upregulate 
faf9, fgf10, grem1 and shh (red) but not fgf8 nor fgf17. SAG-treated 

PBs also do not upregulate fgf8 or fgf17 and do not produce accessory 
limb outgrowths, indicating that SHH signalling is not sufficient for 
accessory limb outgrowth from PBs. PBs complemented with an anterior 
skin transplant express fgf8, fef9, ff10, fefl7, grem1 and shh and form 
accessory limbs. 
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Extended Data Figure 9 | Expression patterns of gli3, hand2, fgf8, 

Sa f9, fef10, fef17, grem1, shh and hoxd13 in normal limb regeneration. 
a, Expression of Gli3, hand2, fgf8 and grem1 was analysed on adjacent 
sections of a 10-day blastema. Sections are arranged from the distal tip of 
the blastema (top, plane A) to the proximal base of the blastema (bottom, 
plane E). Gli3 is expressed throughout the blastema, with stronger 
expression in the anterior half that lacks hand2 expression and weaker 
expression in the hand2-expressing zone. Fgf8 is expressed in the anterior 


hand2 fgof9 fgf17 fgf10 


anterior 


distal 


Plane A 


proximal 


half that lacks hand2 expression. Grem1 is expressed more proximally than 
the other genes (starting from plane B). It is primarily expressed in the 
anterior half of the blastema, but is also expressed in the posterior half, 
particularly in the proximal part (planes D and E). b, Unlike fgf8; fgf9, 
fgfl0 and fgf17 are not excluded from the hand2-expressing, posterior half 
of a blastema (11 days). c, Shh is expressed in a small posterior domain in 
the blastema (12 days). The planes are sections of the same blastema along 
the proximo-distal axis. Scale bars, 200 1m. 
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Extended Data Figure 10 | HH signalling is required for FGF8-induced 
accessory limb outgrowth from PBs. a, Schematic outline of the 
experiment. Each replicate experiment consisted of transducing limbs with 
fgf8 or mCherry (negative control), followed by injury with nerve deviation 
at 10 or 14 days after transduction. Treatment of FGF8-expressing animals 
with 84M cyclopamine or an equivalent amount of ethanol (cyclopamine 
solvent) was started at 6 days post-wounding. Additionally a group of 
non-transduced PBs were complemented with anterior skin and treated 
with 8 uM cyclopamine (as controls for blocking HH signalling) or ethanol. 
b, Examples of typical outcomes in each condition. Images from the 
condition PB+-FGF8-+cyclopamine are the same as in Fig. 2c. Scale bars, 
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2mm. dpw, days post-wounding. c, Percentage of substantial outgrowths 
in each condition. Data from four experiments. For detailed description 
of the number of limbs in each condition and the way they were scored 
see Supplementary Table 6. The dotted line is for visual aid and intersects 
the y axis at 0%. d, The effect of cyclopamine is specific to HH signalling. 
Limbs were amputated and treatment was started 6 days post-amputation 
with 8 1M cyclopamine alone or in combination with 600 nM SAG. 
Treatment with cyclopamine blocked regeneration, which was rescued by 
co-treatment with SAG. Numbers indicate regenerated limbs out of total 
amputated limbs. Data from two experiments. dpa, days post-amputation. 
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Obesity and its associated comorbidities (for example, diabetes 
mellitus and hepatic steatosis) contribute to approximately 
2.5 million deaths annually’ and are among the most prevalent 
and challenging conditions confronting the medical profession”. 
Neurotensin (NT; also known as NTS), a 13-amino-acid peptide 
predominantly localized in specialized enteroendocrine cells of the 
small intestine* and released by fat ingestion’, facilitates fatty acid 
translocation in rat intestine®, and stimulates the growth of various 
cancers’. The effects of NT are mediated through three known NT 
receptors (NTRI, 2 and 3; also known as NTSR1, 2, and NTSR3, 
respectively)®. Increased fasting plasma levels of pro-NT (a stable 
NT precursor fragment produced in equimolar amounts relative to 
NT) are associated with increased risk of diabetes, cardiovascular 
disease and mortality’; however, a role for NT as a causative 
factor in these diseases is unknown. Here we show that NT- 
deficient mice demonstrate significantly reduced intestinal fat 
absorption and are protected from obesity, hepatic steatosis 
and insulin resistance associated with high fat consumption. 
We further demonstrate that NT attenuates the activation of 
AMP-activated protein kinase (AMPK) and stimulates fatty 
acid absorption in mice and in cultured intestinal cells, and that 
this occurs through a mechanism involving NTR1 and NTR3 
(also known as sortilin). Consistent with the findings in mice, 
expression of NT in Drosophila midgut enteroendocrine cells 
results in increased lipid accumulation in the midgut, fat body, and 
oenocytes (specialized hepatocyte-like cells) and decreased AMPK 
activation. Remarkably, in humans, we show that both obese and 
insulin-resistant subjects have elevated plasma concentrations of 
pro-NT, and in longitudinal studies among non-obese subjects, 
high levels of pro-NT denote a doubling of the risk of developing 
obesity later in life. Our findings directly link NT with increased 
fat absorption and obesity and suggest that NT may provide a 
prognostic marker of future obesity and a potential target for 
prevention and treatment. 

To examine the potential role of NT in fat deposition and obesity, 
we initially compared NT-deficient mice’™'! to wild-type littermates 
fed a standard diet (18% kcal from fat) for 6 months. Average body 
length, small intestine weight and length, villus height and crypt 
number did not differ between genotypes (Extended Data Fig. la-f). 
However, the epididymal and retroperitoneal fat pads of Nt" mice 
were consistently smaller as compared to wild type (Extended Data 


Fig. 2a—c). Indeed, when challenged with a high-fat diet (HFD; 60% 
kcal from fat), epididymal, retroperitoneal and pericardiac fat deposits 
were markedly smaller in Nt" mice (Extended Data Fig. 2d-f). 
Moreover, body weight was significantly less in both male and female 
Nt‘ mice fed a HFD (Fig. 1a, b) and male Nt" mice fed a low-fat 
diet (LFD; 10% kcal from fat) (Extended Data Fig. 2g) compared with 
wild type. No differences in body weight were noted between female 
N¢t*’* and Nt mice fed a LFD (Extended Data Fig. 2g). EchoMRI 
assessment of total fat (corrected for body weight) further con- 
firmed reduced body fat composition in male Nt" mice (Extended 
Data Fig. 2h). 

Obesity-associated insulin resistance was also attenuated in 
NT-deficient mice. On a HFD, NT-deficient mice demonstrated lower 
levels of fasting plasma glucose and insulin (Fig. 1c), greater insulin 
sensitivity and faster glucose clearance (Fig. 1d) compared with 
wild type. Consistently, insulin-stimulated phosphorylated (p)-Akt 
expression was decreased in the livers of Nt*/* mice fed a HFD, 
whereas Nt’/~ mice fed the same diet demonstrated a similar induc- 
tion of p-Akt after injection of insulin, as noted for both wild-type 
and NT-deficient mice fed a LFD (Extended Data Fig. 2i). In con- 
trast, there was no difference in insulin secretion when comparing 
N¢t*'* and Nt’ mice maintained on a normal chow diet following 
either glucose administration by gavage or refeeding after a 16 h fast 
(Extended Data Fig. 2j, k). Hepatic steatosis (Fig. le), liver triglyceride 
(TG) and cholesterol accumulation (Fig. 1f) were significantly 
decreased in Nt’/~ mice fed a HED. Adipocytes in epididymal fat pads 
of Nt/~ mice showed a reduction in size (Fig. 1g, h) and decreased 
inflammatory infiltrates and numbers of macrophages (F4/80- 
positive cells) (Extended Data Fig. 21, m). Together, these results 
indicate that NT deficiency protects against comorbidities (that is, 
increased insulin resistance and hepatic steatosis) associated with 
high dietary fat intake. 

NT has been linked to hypothalamic leptin signalling’ and is con- 
sidered to be an anorectic peptide based on acute suppression of food 
intake in rats after intracerebral or intraperitoneal (i.p.) administra- 
tion of NT’*!*. When considering total food intake over 22 weeks, 
there were no differences between genotypes in either male or female 
mice fed a LFD or males fed a HFD; however, a slight 10% decrease 
in female Nt mice fed a HFD reached significance (Extended Data 
Fig. 3a). Weekly food consumption was not statistically different in 
male or female mice fed a LFD or males fed a HFD (Extended Data 
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Figure 1 | Protective effects of NT deficiency on obesity and comorbid 
conditions. a, b, Representative male (a) and female (b) Nt*/* and 

Nt‘ mice fed a HFD for 22 weeks (top). Body weight (BW) was measured 
weekly (bottom) (male Nt*/* n= 18 and Nt’- n=17 mice; female 

Ne*’* n=15 and Nt" n=12 mice). Body weight slopes were compared: 
*P <0.05 Nt*'* versus Nt/~ mice. c, Plasma glucose (top) and insulin 
(bottom) levels quantified in 24-h-fasted male mice maintained ona 

LED or HED for 24 weeks (LFD N¢*/t n= 10, Nt/~ n=10; HED Ntt/* 
n= 8, Nt’ n=10). *P < 0.05 versus LED in Nt*’* and Nt" mice, 
respectively; +P < 0.05 versus HED in Ntt’* mice. d, Blood glucose (BG) 
during insulin tolerance test (ITT) (left; 7 = 5) and glucose tolerance test 
(GTT) (right; n = 3) in 6-h-fasted male mice fed a HFD for 24 weeks. 

* P< 0.05 versus Nf ~ mice. e, Gross, haematoxylin and eosin (H&E) and 
oil red O (ORO) imaging of livers of male mice fed a HFD for 24 weeks 
(n=5). Scale bar, 501m. f, Hepatic TG and cholesterol were analysed 

by liquid chromatography-mass spectrometry (LC-MS) in 24-h-fasted 
male mice fed a HED for 24 weeks (n =3). *P < 0.05 versus Nt*!* mice. 
g, H&E staining of epididymal fat from male N¢*/* and Nt/~ mice fed a 
LFD or HED for 24 weeks (n= 5). h, Quantitative analysis of adipocyte 
area (n = 3). *P <0.05 versus LED in Ntt’* and Nt’ mice, respectively; 
+P <0.05 versus LFD in N¢t*’* mice; +P <0.05 versus HED in Ntt!* mice. 
All data are mean + standard deviation (s.d.). Linear mixed model for a, b; 
analysis of variance (ANOVA) with Holm’s P-value adjustment for c, h; 
two-sided Student’s t-test for d, f (see Methods). 


Fig. 3b, c); only the week 9 comparison in females fed a HFD reached 
significance (Extended Data Fig. 3c). Energy expenditure, locomotor 
activity, energy intake and respiratory exchange ratios were not differ- 
ent between Ntt’+ and Ne mice fed either a HED or LED (Extended 
Data Fig. 3d—g). Although we cannot completely dismiss a potential 
effect of NT deficiency on feeding behaviour, this factor alone does 
not appear to play a major part in the lower weight gain observed in 
Nt“ mice fed a HFD. 

We next evaluated differences in intestinal lipid absorption as a pos- 
sible mechanism for the decreased weight gain noted in NT-deficient 
mice. Compared to N¢t*/* mice, faecal TG content was increased by 
~25% in Nt" mice fed a HFD (Fig. 2a), but was not associated with a 
change in stool colour, consistency or output (Extended Data Fig. 4a), 
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indicating that NT deficiency decreases lipid absorption without overt 
signs of fat malabsorption (that is, steatorrhoea). Consistently, fewer 
and smaller lipid droplets were noted in the mucosa of the proximal 
intestine of Nt mice at 30 and 60 min after olive oil gavage com- 
pared with N¢t*’* mice (Fig. 2b). To track and quantify intestinal 
absorption better, !*Cj-oleic acid ('°C,g-OA) was administered by 
gavage and measured in the proximal intestine and plasma by Fourier- 
transform-based mass spectrometry (FT-MS). '°Cs-OA was signif- 
icantly decreased in the intestine and at 2 and 3h in the plasma of 
Nt‘ mice (Fig. 2c). NT administration (3,600 nmol kg! body weight, 
i.p.)'* restored TG accumulation in Nt‘ mice to levels similar to 
those in Nt*’* mice (Fig. 2d). Similarly, pre-treatment of C57BL/6 
mice with SR 48692 (2.5mgkg™', i.p.), a selective non-peptide 
NTRI antagonist that acts peripherally and centrally when adminis- 
tered either i-p. or by gavage’”, decreased intestinal fatty acid absorp- 
tion after olive oil gavage (Fig. 2e). To demonstrate further that the 
effect of SR 48692 was due to disruption of NT signalling, we repeated 
the experiment using Nt~ mice and their wild-type littermates 
and found that, similar to our initial results, pre-treatment with SR 
48692 inhibited intestinal fatty acid absorption in wild-type mice (as 
measured by TG accumulation); however, SR 48692 pre-treatment 
in N¢’~ mice did not further decrease lipid absorption after olive 
oil administration, which, as expected, was significantly decreased 
compared to wild-type mice (Extended Data Fig. 4b). These results 
indicate that SR 48692 attenuation of fat uptake reflects the disruption 
of normal NT signalling and is not due to an unanticipated off-target 
effect. NT treatment also increased fatty acid uptake in rat intestinal 
epithelial-1 (RIE-1) cells (Extended Data Fig. 4c, d) that express NTR1 
and 3 (similar to human intestinal cells (FHs 74 Int)) and mouse 
intestinal mucosa (Extended Data Fig. 4e-g), and short interfering 
RNA (siRNA) knockdown of either NTRI or NTR3—the protein 
products of which have been demonstrated to heterodimerize on the 
cell surface and to broaden the response range for NT signalling!°— 
reduced NT-mediated fatty acid absorption (Extended Data Fig. 4h, i). 
Consistent with a role for NT in HFD-induced weight gain, treatment 
with SR 48692 (2.5 mg per kg body weight, oral gavage, twice a day) 
for 13 weeks significantly attenuated body weight gain in wild-type 
mice fed a HFD (Fig. 2f) without altering food intake (Extended Data 
Fig. 4j). Collectively, these results indicate that NT promotes intestinal 
lipid uptake through NTRI, and possibly NTR3, promoting weight gain 
in mice fed a HFD. Interestingly, it was recently demonstrated’° that 
NTR3-deficient mice, when placed on a HFD, exhibited a similar 
phenotype as NT-deficient mice, thus further emphasizing the poten- 
tial importance of the NT/NTR axis in weight gain associated with an 
overabundance of fat. 

AMPK, a key fuel-sensing enzyme and a critical regulator of 
metabolism, mediates the effects of a variety of hormones!”!®. 
Phosphorylated- AMPK (p-AMPK) was increased in the proximal 
intestinal mucosa of Nt’~ mice fed a standard diet compared with wild 
type (Extended Data Fig. 5a). p- AMPK expression was decreased in the 
proximal intestine of Nt*/* mice after olive oil gavage but was restored 
to control levels with SR 48692 pre-treatment (Extended Data Fig. 5b). 
Treatment of FHs 74 Int and RIE-1 cells with oleate led to an increase 
in p-AMPK (Extended Data Fig. 5c), which was reduced by NT 
pre-treatment (Fig. 2g and Extended Data Fig. 5d). Pharmacological 
activation of AMPK with 5-aminoimidazole-4-carboxamide-1-8-p- 
ribofuranoside (AICAR) further increased oleate-stimulated p-AMPK 
(Fig. 2h) and concomitantly decreased fatty acid absorption (Fig. 2i); 
these effects were blocked by NT pre-treatment. NT-mediated suppres- 
sion of p-AMPK was prevented by either NTR1 or NTR3 knockdown 
(Extended Data Fig. 5e, f). Knockdown of the upstream AMPK kinase 
Ca**t-calmodulin-dependent protein kinase kinase (CaMKK2)!*°, 
but not liver kinase B1 (LKB1)*’, decreased oleate-stimulated p-AMPK 
(Extended Data Fig. 5g). Moreover, overexpression of CaMKK2 
attenuated NT-mediated suppression of p- AMPK (Extended Data 
Fig. 5h). Together, these findings suggest that NT, acting through 
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Figure 2 | NT deficiency reduces intestinal lipid absorption. a, Faecal 
TG was analysed in male mice fed a HFD for 24 weeks (n= 3). *P=0.05 
versus N¢t*'t mice. b, ORO staining of proximal intestines from male 
mice fed normal chow with or without olive oil by oral gavage after 
overnight fasting (n = 3). Scale bar, 501m. c, Levels of '°C,g-OA in male 
mice fed normal chow and given °C;s-OA mixed in olive oil by gavage 
after an overnight fast were analysed by nanospray FT-MS. Left (n=5), 
*P < 0.05 versus Ntt’* mice. Right (n =4), *+P < 0.05 versus 0h in Ne*’t 
and Nt’ mice, respectively; +P < 0.05 versus 2h and 3h in Nt*’* mice. 

d, ORO staining of proximal intestines from male mice following gavage 
with saline, olive oil, or olive oil plus NT (3,600 nmol kg~! body weight, 
ip.) after an overnight fast (left; n =5). Scale bar, 501m. TG (in mg) 

was quantified in proximal intestines and normalized to the amount of 
protein (in mg) as described in Methods (right; n = 5). Graph presents 
the fold change versus saline in Ntt’/* mice. *P < 0.05 versus saline in 
Ntt’* and Net‘ mice, respectively; +P < 0.05 versus olive oil in Nt*/* mice; 
+P <0.05 versus olive oil in Nt" mice; §P < 0.05 versus olive oil in Ntt/*. 
e, Proximal intestines from mice given saline, olive oil, or olive oil plus SR 
48692 were collected and TG levels were quantified as described earlier 
(n= 8). Graph presents the fold change versus control mice. *P < 0.05 
versus control mice; +P < 0.05 versus mice with olive oil only. f, Weekly 
body weight was measured in male wild-type C57BL/6 mice fed a HFD 
and treated with SR 48692 (2.5 mg per kg body weight, diluted in distilled 


NTRI and/or NTR3, increases fatty acid absorption through suppres- 
sion of CaMKK2-mediated AMPK phosphorylation. 

Drosophila provide a powerful model system to understand molecular 
mechanisms regulating human metabolic disorders better. To estab- 
lish further the role of NT on intestinal lipid absorption and AMPK 


Table 1 | Pro-NT levels and association with clinical outcomes 


H,0 and administered by oral gavage twice a day) or vehicle (vehicle 

n= 12; SR 48692 n= 13). Body weight slopes were compared: *P < 0.05 
vehicle versus SR treatment. g, FHs 74 Int cells were pre-treated with 

or without NT at different doses as indicated for 30 min followed by 
combined treatment with oleate (0.1 mM) or bovine serum albumin (BSA) 
for 1h and western blotting of cell extracts (top); densitometric analysis 

of p-AMPK is from three separate experiments and normalized to total 
AMPK; graph demonstrates the fold change of p-AMPK versus control 
(BSA; bottom). *P < 0.05 versus BSA; +P < 0.05 versus oleate alone. h, FHs 
74 Int cells were treated with or without NT (2 1M) for 30 min followed by 
AICAR (1 mM) for 2h and oleate (0.1 mM) or BSA for 1h and analysed 
by western blot (top); p- AMPK levels were determined as in g from three 
separate experiments (bottom). *P < 0.05 versus control (BSA); +P < 0.05 
versus oleate alone; +P < 0.05 versus oleate plus AICAR. i, FHs 74 Int 

cells were treated with NT (2 11M) for 30 min followed by addition of 
AICAR (1 mM) for another 3 h; cells were then incubated with BSA or 
4,4-difluoro-5,7-dimethyl-4-bora-3a,4a-diaza-s-indacene-3-hexadecanoic 
acid Cys (BODIPY FL Cj¢) for 15 min and images were taken by confocal 
microscopy (original magnification, x 180). Representative images are 
from three experiments. All data are mean + s.d. Two-sided Student’s 
t-test for a, c (left); ANOVA with Holm’s P-value adjustment for c (right), 
d (right), e, g (bottom), h (bottom); linear mixed model for f (see Methods). 
(See Supplementary Fig. 1 for gel source data.) 


regulation, human full-length NT complementary DNA (cDNA) 
was expressed in Drosophila midgut enteroendocrine cells using the 
enteroendocrine-cell-specific driver Gr36C-Gal4 (ref. 22) (see colo- 
calization with enteroendocrine-cell-specific transcription factor 
Prospero”*4; Extended Data Fig. 6a, b); mature NT'~’° peptide was 


N total/N cases* Odds ratio (95% Cl) P for trend? 
Pro-NT quartile 17 Pro-NT quartile 2 Pro-NT quartile 3 Pro-NT quartile 4 
Prevalent obesity 4,626/604 1.0 (ref) 1.00 (0.78-1.29) 1.13 (0.88-1.45) 1.34 (1.05-1.70) 0.01 
Prevalent abdominal obesity 4,625/1,769 1.0 (ref) 1.07 (0.90-1.27) 1.23 (1.04-1.46) 1.30 (1.09-1.54) 0.001 
Prevalent insulin resistance 4,468/1,140 1.0 (ref) 1.30 (1.06-1.59) 1.43 (1.17-1.74) 1.70 (1.39-2.06) <0.0001 
New-onset obesity 2,594/333 1.0 (ref) 1.41 (0.95-2.10) 1.79 (1.21-2.65) 2.05 (1.38-3.06) <0.0001 


ref, reference group. 


“Denotes total number of subjects in the analysis divided by the number of cases with listed obesity or insulin resistance phenotype. *Pro-NT quartiles 1-4 define the MDC-CC population quartiles 
(lowest to highest) with median (range) baseline fasting plasma pro-NT concentrations (pmol I~!) of 60.1 (3.3-75.9), 89.3 (75.9-105), 123 (105-149) and 190 (149-1,155), respectively. Data are 
presented as odds ratios (95% Cls), and subjects belonging to the lowest quartile of pro-NT were defined as the reference group (odds ratio = 1). *Denotes the P value for linear trend over quartiles 1-4. 
Logistic regression models were used for comparisons between quartiles and linear trends over quartiles (see Methods). 
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Figure 3 | NT suppresses AMPK activation and promotes lipid 
accumulation in Drosophila. a, Midguts from 7-day-old adult flies 
expressing either Gr36C-w1118 (control, 100%, n =7; the percentage 

used here and in subsequent analyses indicates the percentage of organs 
exhibiting the phenotype) or Gr36C-NT (93%, n= 15) were stained 

with BODIPY. Scale bar, 100|1m. b, Oenocytes on the basal surface 

of the lateral epidermis of 3rd instar larvae expressing Gr36C-w1118 
(100%, n= 15) or Gr36C-NT (93%, n = 15) were stained with BODIPY 

to monitor lipid accumulation (green) and the anti- HNF4 antibody to 
mark the oenocytes (red). Scale bar, 50 um. c, Fat bodies attached to the 
salivary gland from the 3rd instar larvae expressing Gr36C-w1118 (100%, 
n= 14) or Gr36C-NT (100%, n =8) were stained with BODIPY. Scale bar, 
50m. d, Flies expressing Gr36C-w1118 or Gr36C-NT (n=5) were fed 
either SDF or HFD, and guts were stained with Nile Red to examine the 
accumulation of lipids (arrows). Scale bar, 100 |1m. e, NT was expressed by 
voila-Gal4 and TG (in mg) was measured and normalized to body weight 
(in mg) in male adult flies fed either SDF or HFD (n= 3). voila-w1118 
served as control. *P < 0.05 versus SDF in control- and NT-expressing 
flies, respectively; {P< 0.05 versus SDF in control flies; #P < 0.05 versus 
HED in control flies. f, Western blotting was performed to monitor the 
levels of AMPK in adult midgut shown in a. g, Myo1A-Gal4 combined 
with tub-Gal80" (Myo*) does not express active Gal4 at 20°C permissive 
temperature. Shown on the left is the midgut from a 7-day-old adult 
expressing Myo’-AMPK raised at 20 °C, stained with BODIPY, and used as 
a control (100%, n= 11). Midguts expressing AMPK (middle; 94%, n= 16) 
or AMPKRNAi**?*! (right; 100%, n = 12) from 7-day-old adults at 29 °C 
(active Gal4) were stained with BODIPY. Scale bar, 100 1m. The levels of 
AMPK were monitored by western blot. All data are mean + s.d. ANOVA 
with Holm’s P-value adjustment for e (see Methods). (See Supplementary 
Fig. 1 for gel source data.) 


detected in larval gut (3.1 fg per gut) and Drosophila S2 cells trans- 
fected with NT cDNA (Extended Data Fig. 7). Compared to control, 
NT expression markedly increased lipid droplets in the midgut of 
7-day-old adult (Fig. 3a) and larvae (Extended Data Fig. 6c) fed a 
standard diet, and also increased the accumulation of lipid droplets 
in oenocytes (Fig. 3b) and fat bodies (Fig. 3c) of Gr36C-NT 3rd instar 
larvae. Next, we used a diet-induced obesity model in Drosophila” 

demonstrate increased lipid droplets in the midgut of control flies fed 
a HFD compared with flies fed standard food (SDF). NT expression 
dramatically increased lipid accumulation in the midgut (Fig. 3d) and 
total body TG levels (Fig. 3e) with either a SDF or HED, suggesting 
that NT promotes efficient lipid absorption, and that this effect is 
further enhanced by increased fat concentration. Consistently, we 
also found that NT expression decreased gut p-AMPK levels in both 
adults and larvae (Fig. 3f and Extended Data Fig. 6d). These findings 
suggest that, similar to mice, the effects of NT on lipid absorption 
are mediated, in part, through AMPK regulation. Indeed, we found 
that AMPK overexpression decreased, whereas AMPK knockdown 
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increased lipid droplets in the midgut of 7-day-old adults (Fig. 3g) 
and the midgut, fat body and oenocytes of larvae (Extended Data 
Fig. 6e-g). 

To identify the endogenous Drosophila NTR, we carried out a 
targeted RNA interference (RNAi) screen in S2 cells; expression of 
NT or treatment with NT peptide consistently decreased p-AMPK 
(Extended Data Fig. 8a, b). Among the three potential Drosophila 
NTRs?°*’, RNAi of CG9918 (Pyrokinin 1 receptor (PK1-R)), but 
not CG8784 or CG8795, blocked the decrease in p- AMPK levels in 
NT-expressing cells (Extended Data Fig. 8a). The effect of NT does 
not appear to be due to interference with PK-1 signalling, since PK-1 
(encoded by Capability) knockdown did not alter p-AMPK levels 
(Extended Data Fig. 8d). Knockdown of CG9918 expression in entero- 
cytes using Myo1A-Gal4 to drive UAS-CG9918 RNAi (TRiP 27539) 
expression in flies also expressing NT driven by the enteroendocrine- 
cell-specific tachykinin (TK) promoter*®”? markedly attenuated 
NT-induced lipid droplet accumulation (Extended Data Fig. 8c). These 
results indicate that NT increases lipid accumulation, at least in part, 
through CG9918, an NTR-like receptor that shares 32% identity and 
50% similarity with mouse NTRI1 (Extended Data Fig. 8e), suggest- 
ing an evolutionarily conserved function for NTR signalling in lipid 
uptake. 

The mouse and Drosophila data prompted us to assess the pos- 
sible role of NT in the development of obesity and its metabolic 
complications in humans. Fasting plasma concentrations of pro-NT 
were analysed from 4,632 middle-aged subjects of the population- 
based Malmo Diet and Cancer Study Cardiovascular Cohort 
(MDC-CC)? (Extended Data Table 1). The age- and sex-adjusted 
likelihood of being obese, abdominally obese and insulin resistant 
significantly increased across quartiles of pro-NT plasma levels 
(P=0.01, 0.001 and <0.0001, respectively; Table 1). Continuous 
values of pro-NT were also significantly related to continuous values 
of body mass index (BMI), waist circumference and homeostasis 
model assessment of insulin resistance (Extended Data Table 2). 
Among non-obese subjects, the risk of developing obesity during an 
average follow-up time of 16.5 + 1.5 years increased gradually with 
pro-NT quartiles, independently of baseline BMI, age and gender 
(P < 0.0001; Table 1). Importantly, non-obese subjects in the top 
quartile of baseline pro-NT levels had greater than double the risk 
of developing obesity compared to those in the lowest quartile (odds 
ratio = 2.05, 95% confidence interval (CI): 1.38—3.06). Whereas the 
cross-sectional relationship between pro-NT and obesity became 
non-significant after additional adjustment for insulin resistance, 
the prospective relationship between pro-NT and risk of new-onset 
obesity remained highly significant (P = 0.001) after adjustment 
for insulin resistance (data not shown). Thus, pro-NT levels 
strongly predict new onset obesity in a graded manner, which is 
independent of baseline BMI and insulin resistance. These findings 
in adults warrant further prospective studies to evaluate whether 
pro-NT levels can be used to predict future obesity in children and 
adolescents. 

Our findings demonstrate a critical role of NT in HFD-induced obe- 
sity that involves decreased AMPK activation and increased intestinal 
lipid absorption. Moreover, we identify increased pro-NT levels as a 
strong risk factor of human obesity. From an evolutionary perspective, 
metabolically ‘thrifty’ genes, such as NT, are highly beneficial to ensure 
the efficient absorption of all ingested fats, but with the abundance 
of fats in typical Western diets, NT can have a detrimental effect by 
contributing to increased fat storage, obesity and related metabolic 
disorders. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Reagents. Phospho-AMPKa (Thr172) (2535), AMPKa (2532), LKB1 (3050), 
phospho-Akt (Ser-473) (4058) and pan-Akt (4691) antibodies were from Cell 
Signaling Technology (Danvers, MA). NTRI antibody (sc-374492) was from Santa 
Cruz Biotechnology (Dallas, TX). CaAMKK2 (ab168818), NTR3 (ab16640), and 
F4/80 (ab100790) antibodies were from Abcam (Cambridge, MA). Flag (F1804) 
and B-actin (A5316) antibodies were from Sigma-Aldrich (St Louis, MO). AICAR 
was from Cayman (Ann Arbor, MI). Oleate sodium, NTj_;3, glucose and human 
insulin were from Sigma. Deuterated oleic acid (CLM-460-PK) was obtained from 
Cambridge Isotope Laboratories (Tewksbury, MA). SR 48692 was from Tocris 
(Minneapolis, MN). Lipofectamine RNAiMAX and LTX Reagent with PLUS 
transfection reagents and Trizol were from Life Technologies (Grand Island, NY). 
pSG5-Flag-CaMKKz2 rat FL wasa gift from A. Means*? (Addgene plasmid #32449, 
Cambridge, MA). Primers for RT-PCR were from Integrated DNA Technologies 
(Coralville, IA). 

siRNA and sequences. ON-TARGETplus SMARTpool and ON-TARGETplus 
Non-targeting Control Pool siRNAs were purchased from GE Dharmacon 
(Lafayette, CO). The sequences are as below. Human LKB1: (1) UGACUGUGG 
UGCCGUACUY, (2) GCUCUUACGGCAAGGUGAA, (3) UGGAAAGGGAUGCU 
UGAGUA, and (4) GAAGAAGGAAAUUCAACUA. Human CAMKK2: 
(1) GUGAAGACCAUGAUACGUA, (2) GGAUCGUGGUGCCGGAAAU, 
(3) GAUCAAAGGCAUCGAGUAC, and (4) ACAGUAAGAUCAAGAGUCA. 
Human NTRI: (1) GGACUCCGUUCCUCUAUGA, (2) GCAACACGGUGAC 
GGCGUY, (3) GAGCACAGCACAUUCAGCA, and (4) GAACACCGACAU 
CUACUCC. Human NTR3 (also known as SORT1): (1) GAGACUAUGUUGUG 
ACCAA, (2) GAGCUAGGUCCAUGAAVAY, (3) GAAGGACUAUACCAUAUGG, 
and (4) GAAUUUGGCAUGGCUAUUG. Rat Nér1: (1) GGGCACACACAA 
CGGUUUA, (2) CCGAAAUGGAAGCGACGUY, (3) CUACGUUCCUCUUCG 
AUUU, and (4) GCUACUAUUUCCUGCGUGA. Rat Ntr3 (also known as Sort1): 
(1) ACAAAUGGGUACCGGAAAA, (2) GAACACAGCAACCGUCCUA, 
(3) AAGACAUCCUUGAGCGCAA, and (4) AAGCAGAAUUCCAAGUCGA. 
Mice. All procedures were approved by the Institutional Animal Care and Use 
Committee of the University of Kentucky. Nf" mice and their wild-type littermates 
(Ntt’*) were bred from Nt*’~ mice and randomly grouped for all experiments. 
Mice were maintained with a 14h light/10h dark cycle and provided with food 
and water ad libitum. For diet-induced obesity studies, male and female Nt*/* 
and Nt’~ mice were placed on a 60% HFD or 10% LFD (catalogue no. D12492 
and D12450B, respectively; Research Diets, New Brunswick, NJ) at weaning for 
22-24 weeks. Body weight (and food intake were measured weekly. Food intake 
was measured for each cage (3-5 mice per cage) and divided by mouse number to 
obtain total grams consumed per mouse per week. All mice used were 4-6 months 
old unless otherwise indicated. For chronic SR 48692 treatment on HFD-fed 
mice, male C57BL/6 mice (2 months old) were obtained from Taconic. After 
1 week acclimation, mice were placed in individual cages, started on HFD, and after 
1 week were divided into two groups, one of which received SR 48692 (dissolved 
in sterile distilled HO by brief sonication, 2.5 mg per kg body weight*!*? and the 
other vehicle twice a day (8 a.m. and 8 p.m.) by gavage. Body weight and food intake 
were measured weekly. 

Human intestinal samples. Surgical samples of duodenum and colon were 
obtained from de-identified donors through the Markey Cancer Center 
Biospecimen and Tissue Procurement Shared Resource Facility. All samples 
were obtained after informed consent according to a protocol approved by the 
Institutional Review Board of the University of Kentucky Medical Center (UKMC). 
Tissues were processed within 1h after resection; the mucosal layer was sharply 
dissected from the underlying seromuscular layer and collected in multiple 3-5 mm 
sections for western blotting. 

Cell lines, transfection and treatment. FHs 74 Int human small intestinal epi- 
thelial cells** were purchased from ATCC (Manassas, VA) and maintained in 
Hybri-Care Medium ATCC 46-X supplemented with 30ng ml7’ epidermal 
growth factor (Sigma-Aldrich) and 10% FBS. RIE-1 rat intestinal epithelial cells*° 
were maintained in DMEM containing 2mM t-glutamine, 4,500 mg1~! glucose 
and 10% FBS. FHs 74 Int cells were tested for authentication via STR profiling in 
April 2015 by Genetica DNA Laboratories (LabCorp Speciality Testing Group; 
Burlington, NC) using the commercially available PowerPlex 16HS amplifica- 
tion kit (Promega Corporation) and GeneMapper ID v.3.2.1 software (Applied 
Biosystems). Authentication was confirmed by a 100% match in comparison to 
the reference STR profile from ATCC (FHs 74 Int; ATCC CCL-241). The cell lines 
are not listed in the International Cell Line Authentication Committee (ICLAC) 
database. In addition, both cell lines were tested for mycoplasma contamination 
via PCR (e-Myco Plus kit; iNtRON Biotechnology) and were found to be negative. 
Reverse transfection was performed using RNAiMAX (for siRNA) or LTX with 
PLUS (for plasmid) transfection reagents. Final siRNA concentrations were used 
at 20 (NTR3), 40 (LKB1, CAMKK2) or 100 (NTR1) nM. Cells were treated 72h 


(siRNA) or 48h (plasmid) after transfection. For combined treatment of oleate 
with NT, cells were pre-treated with BSA or NT (21M) or NT at various dosages 
in serum-free media as indicated in the figures for 30 min followed by oleate 
(0.1 mM) for 1h or overnight. For treatment with oleate alone, cells were treated 
with BSA or oleate at different concentrations as indicated in the figures for 1h 
in serum-free media. 

Western blotting. Tissues and cells were lysed with lysis buffer (Cell Signaling 
Technology), and equal amounts of protein were resolved on 4-12% NuPAGE 
BisTris gels (Life Technologies), electrophoretically transferred to polyvinylidene 
difluoride (PVDF) membranes, and western blotting was performed as previously 
described*®*”, p-AMPK expression was analysed by densitometry and normalized 
to total AMPK expression using NIH Image] software from three separate exper- 
iments. Data are presented as fold change. 

Histology and immunohistochemistry. Tissues were fixed in 10% neutral- 
buffered formalin, embedded in paraffin, and sectioned (51m). H&E staining was 
performed using standard techniques. For H&E and oil red O (ORO) staining, 
liver tissues were collected from male N¢*/* and Nt" mice fed a HED for 24 weeks 
after weaning. For ORO staining, liver tissues were fixed in 10% neutral-buffered 
formalin, equilibrated in 30% sucrose, embedded in OCT compound and 
snap-frozen in liquid N. Frozen sections were stained with ORO (Sigma-Aldrich) 
for lipid deposition using standard methods. Adipocyte size was measured as 
described previously**. Briefly, sections of epididymal adipose tissue from each 
mouse were photographed under x 100 magnification. In a square measuring 
700 x 700\1m (x and y axis, respectively), adipocyte size and number were meas- 
ured using NIS Elements BR.3.10 software. A criterion for inclusion of meas- 
urements was a circularity of adipocytes superior to 0.33 (shape of cells; from 
0 (thin shape) to 1 (perfect circle)). Immunohistochemistry was performed and 
visualized by Dako EnVision Systems (Burlington, Ontario, Canada) following 
the product instruction. 

RT-PCR. Total RNA was isolated from cells using RNeasy Kit (Qiagen, Valencia, 
CA) according to the manufacturer’s instructions. RT-PCR analysis of NTR1, 2 
and 3 was performed using cDNA synthesized from 1 jg of total RNA. B-Actin 
was used as the internal control. The primers included: human NTRI: 
5'-TCATCGCCT TTGTGGTCTGCT-3’ and 5’-TGGTTGCTGGACACGCT 
GTCG-3’; human NTR2: 5'- GTCTCCTCAGCTTCATCGTAT-3’ and 5’-TCCCC 
AAAGCCTGAAGCTGTA-3’; human NTR3: 5’‘-AGAATGGTCGAGACTAT 
GTTG-3’ and 5‘-AAGAGCTATTCCAAGAGGTCC-3’; rat Ntr1: 5‘-GAGAAG 
CCCCCAAAATTCTC-3’ and 5’-CAAGGACCCAGTGCAGGTAT-3’; rat Ntr2: 
5’-ACTCGCTCATCTTCGCATTT-3/ and 5’-TGGGACCACACGAAGTTGTA-3’; 
rat Ntr3: 5'‘-TTTCAAGCTGTGCTTTGTGG-3’ and 5’‘-AGTTCTCTGA 
ACGGGAGCAA-3’. B-Actin: 5‘-TCACCAACTGGGACGACATG-3’ and 
5’-ACCGGAGTCCATCACGATG-3’. The PCR products were analysed on a 2% 
agarose gel. 

Glucose and insulin tolerance tests. Insulin (ITT) and glucose (GTT) tolerance 
tests were performed on 6-h-fasted male N¢*/* and Nr/~ mice fed HED for 24 weeks 
after weaning. Glucose values were measured using One Touch Ultra from LifeScan 
(Wayne, PA) by tail snip. Glucose (1.5 g per kg body weight) and human insulin 
(0.75 U per kg body weight) were injected intraperitoneally (i-p.) after baseline 
glucose levels were established in each mouse, and blood glucose levels were measured 
15, 30, 60 and 120 min after injection. 

Metabolic studies. Whole-body composition parameters were measured in 
male N¢t’* and Nt‘~ mice fed either a LFD or HFD for 24 weeks after weaning 
by EchoMRI-100 Whole Body Composition (Echo Medical System, Houston, 
TX) using magnetic resonance relaxometry to precisely measure total body fat, 
lean mass, body fluids and total body water in conscious mice. A TSE LabMaster 
indirect calorimetry system (TSE-Systems, Chesterfield, MO) was used to simul- 
taneously quantify energy expenditure, energy intake, locomotor activity and 
respiratory exchange ratio (RER). Mice were acclimated to the chambers for 
7 days, to permit recovery from the weight loss initially experienced by obese 
mice. Recordings were performed for 5 days, yielding three full 24-h periods of 
data. Feeding and activity data were collected continuously; O2 and CO; levels 
for energy expenditure and RER calculations were collected at 30-min intervals. 
Resting energy expenditure values were calculated from data collected between 
9 a.m. and 6 p.m., filtered to remove points at which activity was greater than 
150 counts for that interval. 

For hepatic TG and cholesterol measurements, liver tissues were collected 
from male N¢t*/* and Nt" mice fed a HFD for 24 weeks after weaning; TG and 
cholesterol were extracted as described previously*? and analysed by LC-MS 
coupled with electrospray ionization tandem using stable isotope dilution” 
performed on AB Sciex 4000 Q-Trap instruments. Fasting plasma glucose was 
analysed using a Glucose Colorimetric Assay Kit II (BioVision) and insulin using 
a Mouse Insulin ELISA kit (Mercodia, Winston Salem, NC). To monitor insulin 
secretion in response to glucose stimulation, plasma insulin and glucose levels 
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were measured in mice (fed standard chow after weaning) that had been fasted 
overnight and subsequently given glucose (2 g per kg body weight) by gavage 
(that is, oral GIT). Blood was collected from tail snip before (0 min) or 15, 30, 
60 and 120 min after gavage. Insulin and glucose levels were measured using 
Glucose Colorimetric Assay Kit II and Mouse Insulin ELISA kit as described 
earlier. Plasma insulin levels were also determined in mice that were either fasted 
for 16h or fasted for 16h with refeeding for 4h; blood was collected from the IVC. 
For faecal TG assay, mice were housed in individual cages for 4 days, day-4 faecal 
samples were dried and ground, lipid was extracted from 50 mg faeces, and TG 
was measured by LC-MS. 

Body length measurements. Anal-nasal length was measured following isoflurane 
anaesthesia of male and female Nt*/+ and Nt“ mice (7 months old). 

Faecal weight measurements. Male N¢*/* and Nt~ mice fed either normal chow 
(NC) or a HFD for 24 weeks after weaning were separated into individual cages 
and faecal pellets were collected and weighed daily for 4 days. Faecal weights from 
day 4 were averaged for wild-type and NT-deficient mice. 

Small intestinal characterization. The entire small intestine (SI), from the gastric 
pylorus to the ileocecal valve was dissected from anaesthetized male Nt*’* and 
Nt‘ mice (7 months old). The SI length was measured and then opened longitu- 
dinally, washed in cold saline to clear the luminal contents, dried briefly on a paper 
towel and weighed. The SI was divided 5 cm distal from the pyloric junction and 
the jejunoileum was divided into equal proximal and distal fragments. Proximal 
fragments were fixed in 10% neutral-buffered formalin for 24h, and ‘Swiss rolls’ 
were sectioned (5\1m) for H&E staining. H&E-stained sections were imaged on 
an Aperio ScanScope XT slide scanner at x20. Villus height and crypt counts were 
determined using Aperio ImageScope v.11.2.0.780 software. Crypts in a 1-mm field 
were counted; 10 fields were analysed per section. Villus height was measured from 
10 well-oriented villi on each slide. 

Lipid absorption studies. For olive oil administration and ORO analysis, male 
Nett’* and Nt’ mice on NC were fasted overnight, fed olive oil (17 1l per g body 
weight) by gavage, and killed either before (control), or 30 min and 60 min after 
gavage. The intestine was resected from the ligament of Treitz to the ileocecal junc- 
tion, divided into proximal, middle, and distal segments of equal length, washed 
with cold saline, and the proximal segment was processed for frozen sectioning 
and ORO staining using standard protocols. 

For °Cjs-OA experiments, male Nt*/* and Nt’~ mice on NC were fasted 
overnight, fed 1°C;s-OA (480 1g per g body weight) mixed in olive oil (10 per 
g body weight) by gavage, killed either 0 or 30 min later, and proximal intestinal 
segments were collected as described earlier. To detect plasma level of °C js-OA, 
mice were given '°C,s-OA as described earlier and blood was collected from tail 
snip either before (Oh), or 1, 2 and 3h after gavage. Lipid was extracted from 
proximal intestines and plasma” and !3C;s-OA levels were determined by direct 
infusion nanospray FT-MS (Orbitrap Fusion Tribrid Mass Spectrometer, Thermo 
Scientific, Waltham, MA) using a modification of a previous method”. Briefly, ali- 
quots of the lipid extracts were diluted 20-fold in isopropanol:chloroform:methanol 
(4:2:1), introduced into the TriVersa NanoMate (Advion, Ithaca, NY) and analysed 
on the Orbitrap Fusion in negative-ion mode to estimate the appropriate amount 
of d34-OA to spike. Lipid extracts were then spiked with a known concentration 
of d34-OA and re-analysed. The nanospray conditions on the Nanomate were as 
follows: 15 il of sample injection, 16 min of delivery time, 0.4 psi of gas pressure, 
and 1.5kV of negative applied voltage. The Orbitrap Fusion MS analysis conditions 
were as follows: mass resolution setting of 450,000 with lock mass using internal 
calibrant, scan range of m/z 150-1,600, percentage S-Lens RF Level of 60, 1.0 x 10° 
for AGC Target, 100 ms maximum injection time, and 10 averaged microscans. 
The ion transfer tube temperature was 275°C, and the experimental mass accuracy 
was + ~1 p.p.m. The unlabelled, '*C-labelled, and d34-OA were assigned based on 
their respective accurate mass of 281.24860, 299.30899, and 314.45574; unlabelled 
and !3C-labelled OA were then quantified against the internal d34-OA standard 
using their respective peak intensity. 

For NT ‘rescue’ experiments, male Nt*/* and Nr/~ mice on NC were fasted 
overnight, fed saline or olive oil (17 ul per g body weight) by gavage, injected i.p. 
with saline or NT (3,600 nmol per kg body weight), and, after being killed 60 min 
later, proximal intestinal fragments were dissected as described earlier and further 
divided into proximal and distal halves that were either processed for ORO staining 
or used for TG quantification, respectively, as described previously with modifi- 
cations“. Briefly, intestinal tissues were homogenized in 200 l of 0.5% Tween-20 
in PBS, heated for 5 min at 70°C, homogenates were microcentrifuged (3 min), 
and TG measurements were performed on the supernatants using a Triglyceride 
Determination Kit (Sigma-Aldrich)“ according to the manufacturer’s instructions. 

For in vitro fatty acid absorption or lipid accumulation studies, FHs 74 Int and 
RIE-1 cells (in serum-free medium) were incubated with either bovine serum 
albumin (BSA; fatty-acid-free) (Sigma-Aldrich) or BSA-conjugated BODIPY FL 
Ci6 (4,4-difluoro-5,7-dimethyl-4-bora-3a,4a-diaza-s-indacene-3-hexadecanoic 
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acid) (Life Technologies) (51M) for 15 min and images were obtained using an 
FV1000 Olympus confocal microscope (Olympus, Tokyo, Japan) using a x60, 
1.35 numerical aperture (NA) oil objective, zoom 3 and analysed with Olympus 
FV10-ASW v.3.1b software. Florescence intensity was determined on a total of 30 
cells from 3 fields (10 cells per field). Lipid accumulation, RIE-1 cells were pre- 
treated with or without NT for 30 min followed by combination treatment of BSA 
or oleate (0.1 mM) with or without NT overnight. Cells were homogenized and TG 
measured using Triglyceride Determination Kit as described earlier. Experiments 
were repeated at least three times. 

For acute SR 48692 treatment, one experiment was performed on male wild- 
type C57BL/6 mice obtained from Taconic (Hudson, NY). Mice were acclimated 
for 1 week, randomly divided into control, olive oil, and olive oil plus SR 48692 
groups and fasted overnight. SR 48692 was dissolved in dimethylsulfoxide (DMSO; 
50mgml') and diluted 1:100 in saline just before use. Mice were injected i.p. with 
SR 48692 (2.5 mg per kg body weight)*+*? or vehicle 30 min before oral gavage of 
olive oil (10 ul per g body weight), killed after an additional 30 min, and the proxi- 
mal intestine (see earlier) was used to determine TG content and for preparation of 
lysates for western blotting. This experiment was repeated in male Nt*/* and Ne 
mice (7 months old), and proximal intestinal TG was measured as described earlier. 
Drosophila studies. Full-length human NT cDNA was inserted into an attB- UAST 
backbone* and transgenic lines were generated using an attP2 locus and PhiC31 
integration system. UAS-AMPK and UAS-AMPK-RNAi (v106200) lines have been 
described***’, UAS-AMPK-RNAi line (TRiP #25931, Bloomington Stock Center) 
produced similar phenotypes to UAS-AMPK-RNAi (v106200) and was used for 
most experiments. The UAS-CG9918-RNAi line (TRiP #27539, Bloomington Stock 
Center) has been described*®. Gr36C-Gal4, Myo1A‘-Gal4, voila-Gal4, and $106- 
Gal4 have been described’? *! (FlyBase). Myo1 A-Gal4 is specifically expressed 
in enterocytes. To constitutively express NT in gut enteroendocrine cells, a gut 
enteroendocrine-cell-specific TK promoter (2.0 kb)”*”? was cloned into attB- UAST 
lacking Gal4-binding sites followed by insertion of full-length NT cDNA and the 
resulting plasmid (TK-NT) was used to generate a transgenic line by insertion 
at the VK5 attP locus. To express NT in enteroendocrine cells and knock down 
CG9918 by RNAi in enterocytes, the following genotype was used: yw; Myo1A- 
Gal4/CG9918RNAi’”*’; TK-NT/ +. tub-Gal80* was used in combination with the 
Myo1A-Gal4 (resulting in Myo’) to suppress Myo1A-Gal4-driven expression’, 
until shifting larvae to 29°C (non-permissive temperature) 96h after egg laying 
(AEL) or adult emerging. AMPK or AMPK RNAi were expressed in the fat body 
using the RU486-inducible GeneSwitch $106-Gal4 line. $106-AMPK or S106- 
AMPK-RNAi larvae (3rd instar) were grown on RU486- (200\.M, Mifepristone, 
Sigma-Aldrich) or vehicle-containing food and lipid droplet accumulation in fat 
body and oenocytes was examined by BODIPY (Invitrogen) and immunohisto- 
chemical (anti- HNF4 oenocyte marker>’) staining. Midguts (3rd instar larvae, 
adult) were dissected, fixed in 4% formaldehyde in PBS for 20 min, permeabilized 
in 0.1% Triton X-100 in PBS, and processed for either immunohistochemistry 
using mouse anti-Pros (DSHB, MR1A) and rabbit anti- NT (Abcam, ab43833) 
primary antibodies or lipid staining using BODIPY, Nile Red, or ORO (larvae). 

AMPK and p-AMPK were detected by western blot analysis using protein 
extracts from gastrointestinal tract (15 per sample) using anti-p-AMPK (Cell 
Signaling, #2535), anti-AMPK (Abcam, ab80039) and anti-$-tubulin control 
(DSHB, E7) primary antibodies and ECL as described. 

Adult flies were fed a HFD or standard food (SDF, cornmeal yeast) as 
described”». Briefly, flies (5 days after emerging) were collected and maintained 
on SDF for an additional 5 days, split into two groups, and fed either SDF or 
SDF containing 20% (w/v) coconut oil (HFD) for a further 5 days, followed by 
dissection and lipid staining using BODIPY or Nile Red. voila-NT and voila- 
Gal4-w1118 control flies were fed a HFD or SDF as described earlier for TG meas- 
urement (25 flies per sample) as described**. Fluorescence signals were acquired 
on an Olympus confocal microscope and images processed with Olympus 
FV10-ASW v.3.1b. 

Drosophila S2 cells** were transfected with ub-Gal4, UAST-NT, and individ- 
ual dsRNAs”» to inhibit the expression of NTR-like receptors CG9918 (nucleo- 
tides 11-570), CG8784 (nucleotides 72-478), CG8795 (nucleotides 203-343), 
Capa (nucleotides 1-455) or GFP control (nucleotides 6-606) using Effectene 
(Qiagen). Cells (6 x 10°) were dissolved in lysis buffer (450 ul) 48h post- 
transfection, microcentrifuged (12,000 r.p.m., 10 min), and supernatants were 
analysed by western blotting. For quantitative RT-PCR, cDNA was synthe- 
sized (SuperScript III First Strand Synthesis, Invitrogen) from 1.0\g Trizol- 
extracted RNA and used for quantitative RT-PCR (95 °C, 30s followed by 
40 cycles, 95°C, 5s; 55°C, 30s; 72°C, 15s) using SYBR Green (Thermo-Fisher) 
and the AC; method (StepOnePlus Real-Time PCR System, Applied Biosystems). 
Primer pairs were as follows: CG9918, 5‘-GAGTTTCAACGGCGGAGGAA-3’, 
5’-AGCAGAGGAAGAAGCACACC-3/; CG8784, 5‘-GGCGTGCTGGGTAATCT 
TAT-3’, 5'/-CAAAGGTTGTACAGCTCCTG-3’; CG8795, 5’-GCTACGCCCTC 
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ATATTTATC-3’, 5'-GAGGTTATAGAGGTCCTGCG-3’; Capa, 5/-ATGAAATCT 
ATGTTGGTC-3’, 5'-CCAACGCGCGGGAAGGC-3'; actin, 5’-GCGTCGGTCAA 
TTCAATCTT-3’, 5’-AAGCTGCAACCTCTTCGTCA-3". All S2 cell experiments 
were independently repeated at least three times. 
NT enzyme immunoassay. Fifty microlitres of media from S2 cells transfected 
with ub-Gal4 and UAST-NT were used to measure NT levels using NT EIA Kit 
from Phoenix Pharmaceuticals (Burlingame, CA) as described previously**”. 
LC-MS/MS analysis of NT. NT was analysed by liquid chromatography tandem 
mass spectrometry (LC-MS/MS) using an LTQ-Orbitrap mass spectrometer 
(Thermo Fisher Scientific, Waltham, MA) coupled with an Eksigent Nanoflex 
cHiPLC system (Eksigent, Dublin, CA) through a nano-electrospray ionization 
source’. NT in the NT EIA kit (Phoenix Pharmaceutical) was used as standard. 
S2 cells were transfected with ub-Gal4 and UAST-NT or the UAST vector control, 
and conditioned medium was harvested after 48 h for analysis. GI tracts (350 per 
genotype) were dissected from wild-type (w1118) or Gr36C-NT 3rd instar larvae, 
methanol extracted”’, extracts were dried, and dissolved in 20 jl 0.1% (v/v) formic 
acid in water for analysis. Samples were separated by reversed-phase cHiPLC 
(ChromXP C18 column, 75 1m I.D. x 15cm length, Eksigent catalogue no. 
804-00001; mobile phase A and B were 0.1% (v/v) formic acid in either water or 
acetonitrile, respectively; flow rate, 300nl min~'). LC-MS/MS data were acquired 
in an automated data-dependent acquisition mode consisting of an Orbitrap MS 
scan (300—1,800 m/z, 60,000 resolutions) followed by MS/MS for fragmentation 
of the seven most abundant ions with the collision-induced dissociation method. 
MS/MS fragments corresponding to NT!’ were identified by comparison with 
the NT standard, and quantified using the intensity of the NT°* peak and NT 
standards (0.05, 0.1, 0.2, 0.5 fmol)””. 
Human population studies. The Malm6 Diet and Cancer (MDC) study is a 
population-based, prospective epidemiological cohort of 28,449 men (born 1923- 
1945) and women (born 1923-1950) from the city of Malm6 in southern Sweden 
who underwent baseline examinations between 1991 and 1996°°. From this cohort, 
6,103 persons were randomly selected to participate in the MDC Cardiovascular 
Cohort (MDC-CC), which was designed to investigate the epidemiology of carotid 
artery disease, between 1991 and 1994°’. Fasted plasma samples at the baseline 
examination were available for analysis of pro-NT and successfully measured in a 
total of 4,632 participants in the MDC-CC. Of those, complete data were available 
for BMI in 4,626, for waist circumference on 4,625, and for estimated degree of 
insulin resistance using the homeostasis model assessment of insulin resistance 
(HOMA-IR) (fasting blood glucose concentration x fasting plasma insulin 
concentration/22.5)© in 4,468 participants. BMI was defined as body weight in 
kilograms divided by the square of height in meters and obesity as a BMI > 30kgm ~”. 
Abdominal obesity was defined as a waist circumference of >94cm in males 
and >80cm in females, according to the International Diabetes Federation 
definition’. Insulin resistance was regarded present in subjects belonging to the top 
25% of HOMA-IR values in the MDC-CC-. ‘New-onset obesity’ is defined as obesity 
development among non-obese MDC-CC participants who were re-examined 
and diagnosed with obesity after an average follow-up time of 16.5 + 1.5 years. 
Pro-NT was measured in stored fasting plasma specimens (all samples were assayed 
in 2010) that were immediately frozen to —80°C at the MDC-CC baseline exam- 
ination using a chemiluminometric sandwich immunoassay to detect a pro-NT 
precursor fragment (pro-NT 1-117) as described previously®’. Analyses of blood 
glucose and plasma insulin were carried out at the time of baseline examination 
at the Department of Clinical Chemistry, Malm6 University Hospital, which is 
attached to a national standardization and quality control system®’. Of the 4,626 
subjects with baseline data on BMI and pro-NT, 2,900 subjects were re-examined 
with a new measurement of BMI after a mean follow-up of 16.5 + 1.5 years. In 
analyses of incident obesity, we excluded 306 subjects who were obese already at 
the baseline examination, leaving a total of 2,594 non-obese subjects for analysis 
of pro-NT in relation to incident obesity. All participants gave written informed 
consent, and the study was approved by the Ethical Committee at Lund University, 
Lund, Sweden. 
Statistical analysis. Descriptive statistics including means and standard deviations 
were calculated and represented using bar or line graphs. Linear mixed models 
were implemented to compare body weight and food intake levels over time as well 
as comparisons at each time point for other endpoints from in vivo experiments 
involving repeat measurements. Specifically, body weight growth curves were 
compared using linear mixed models with fixed effects for linear and quadratic 
terms for time and their interaction with genotype and random effects for the inter- 
cept and time factors. Contrast statements for the interaction terms in the model 
were used to assess differences in growth curves between genotypes. Likewise, 
linear mixed models were employed for body weight growth curve comparisons 
between vehicle versus SR 48692 with the same fixed and random effects terms 
as earlier along with baseline weight as a covariate. Total food consumption for 
22 weeks was calculated and compared between genotypes and between vehicle 


versus SR 48692 treatment using two-sample t-tests. Furthermore, weekly food 
intake levels were analysed using linear mixed models. However, since there is no 
clear trend in weekly food intake over time, an overall comparison of trend was 
not employed. Instead, a linear mixed model with fixed effects for genotype, time 
and their interaction and a random effect for the intercept term was employed in 
order to perform individual comparisons at each time point. A P-value adjustment 
due to multiple testing at each time point was performed using the Holm’s step- 
down procedure. We used the Akaike information criterion (AIC) to evaluate the 
goodness-of-fit of the above linear mixed models. Two-sided, two-sample Student's 
t-test was employed for cell culture and in vivo studies involving two independent 
groups. One-way or two-way analysis of variance models with test for interac- 
tion between two factors was employed for multiple group comparisons of diet 
(LFD, HFD), mouse genotype, feeding and fasting groups, different time points 
of measurement, varying doses of NT treatment, SR48692 and AICAR treatments 
with Holm’s P-value adjustment for multiple pairwise comparisons. ANCOVA 
was employed to compare genotype and diet with adjustment for the confounding 
effect of mouse body weight for fat composition and energy metabolism endpoints 
(energy expenditure, locomotor activity, energy intake, respiratory ratio). Model 
building was performed to assess equality of slopes using two-way interactions 
between each of genotype and diet with the body weight covariate as well as gen- 
otype and diet interactions. 

For the animal studies, sufficient sample sizes were used to provide at least 80% 
power to detect large effect sizes (1.0 to 3.0 mean differences in s.d. units) based 
on two-sided, two-sample t-tests with 5% significance level. Some experiments 
including body weight and food intake were measured repeatedly, thus affording 
larger statistical power. No replicate samples from in vitro studies were excluded 
in the analysis. All data from animal studies with measurement of study endpoints 
were included in the analysis. Experiments with slight differences in animal num- 
bers per group were due to the number of animals that were successfully bred and 
not due to exclusion of certain animals and their data points. Mice within a cage 
were randomized to all groups in an experiment to ensure balance in treatment 
group assignments across all cages. The animals were randomly selected for group 
assignment without preference to size or other confounding factors. A different 
individual performed measurements on study endpoints to ensure blinding from 
group assignment. Furthermore, only animal IDs without information on group 
assignment were available to staff performing the endpoint evaluation. Parametric 
tests were used after evaluating distribution of data (for example, percentiles, mean 
and median levels), test for normality (for example, Kolmogorov—Smirnov test, if 
sufficient sample sizes) and test for homogeneity of variance assumptions across 
groups. Otherwise, data log transformation or nonparametric tests were used. 
Appropriateness of other statistical models including linear mixed models was 
evaluated for goodness-of-fit using the AIC and equality of slopes between groups 
was evaluated for the ANCOVA models. 

All subjects at the MDC-CC baseline examination were divided into ascending 
quartiles according to their value of fasting pro-NT. BMI, waist circumference 
and HOMA-IR were analysed as dichotomous outcome variables as defined 
earlier. In cross-sectional analyses, we related baseline quartile of pro-NT to the 
dichotomous outcome of obesity, abdominal obesity, and insulin resistance using 
age- and sex-adjusted logistic regression models. In the analyses of incident obesity, 
we related baseline quartile of pro-NT to the dichotomous outcome of incident 
obesity using logistic regression adjusted for baseline age, sex and BMI. Data are 
presented as odds ratios (95% confidence intervals), and subjects belonging to the 
lowest quartile of pro-NT were defined as the referent group (odds ratio= 1). “P for 
trend’ denotes the P value for linear trend over quartiles 1-4. For code availability, 
statistical analysis codes in SAS version 9.4 were used for analysis of in vivo and 
in vitro studies and SPSS version 22.0 for analysis of human data and can be 
accessed by contacting B.M.E. 
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Extended Data Figure 1 | NT deficiency does not affect body length or Proximal intestinal samples were H&E stained (d; scale bar, 501m); 
small intestine morphology. a, Body length did not differ significantly villus height (e; n = 6) and crypt numbers (f; n = 6) have no significant 
between genotypes for either male (N¢t/* n= 13, Nt’ n= 12) or female differences between genotypes. Mice (7 months old) for all experiments 
(Ntt/* n=12, Nt’ n=12) mice. b-f, The average weight (b) and length were maintained on standard chow. All data are mean + s.d. Two-sided 
(c) of the small intestine was similar between the genotypes (n = 7). Student’s t-test for all figures. 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | NT deficiency inhibits adiposity, 
inflammation and improves insulin/Akt signalling. a—c, Epididymal fat 
pad before (a; arrows) and after (b) dissection, and retroperitoneal fat pad 
(c; arrows) from male mice maintained on a normal chow (n=5). 

d-f, Representative epididymal (d; arrows), retroperitoneal (e) and 
pericardial fat pads (f) of male mice fed a HFD for 24 weeks (n=5). 

g, Weekly body weight (male N¢t/* and Nt" n= 13) and (female Ntt/* 
n= 12, Nt’ n= 14) of mice ona LED for 22 weeks. *P < 0.05 Nt*’* versus 
Nt‘~ in male mice. h, Fat composition in male mice fed a LFD or HED for 
24 weeks (n =5). *P < 0.05 versus LFD fat; +P < 0.05 versus Nt*’* LED fat; 
+P < 0.05 versus N¢*/+ HFD fat; §P < 0.05 versus N¢*/* HED lean. i, Ntt/* 
and Nt! mice fed a LFD or HFD for 22 weeks after weaning were fasted 
overnight. Saline or insulin (5 units) was injected into the inferior vena 
cava (IVC). Five minutes after the injection, liver tissues were collected 
and protein extracts analysed by western blot. A representative result is 
shown from three separate experiments. j, Male Nt‘/* and Nt~'~ mice 


(7 months old) maintained on standard chow were fasted overnight 
followed by glucose (2 mg per kg body weight) administration by gavage 
(n=5). Blood samples were collected both before and at the indicated 
times after glucose administration for measurement of plasma insulin 
(left) and glucose (right) levels. No statistically significant differences in 
either insulin or glucose levels were apparent between genotypes. k, Male 
N¢t*’+ and Nt‘~ mice (12 months old) fed a standard diet after weaning 
were fasted for 16h or fasted for 16h and refed for 4h (n =3). Blood was 
collected from the IVC and plasma was used to measure insulin. *P < 0.05 
versus fasted Nt*’* mice. 1, H&E staining of epididymal fat pad from male 
mice shown in Fig. 1g demonstrating inflammatory cells (arrows) (scale 
bar, 100 1m). m, F4/80-positive macrophages (arrows) in epididymal 

fat pad from HFD-fed mice (scale bar, 100 um) (n=5). All data are 

mean +s.d. Linear mixed model for g; analysis of covariance (ANCOVA) 
with Holm’s P-value adjustment for h; ANOVA with Holm’s P-value 
adjustment for j, k. (See Supplementary Fig. 1 for gel source data.) 
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Extended Data Figure 3 | Food intake and indirect calorimetry 
measurements. a—c, Food intake was measured weekly for Nt*/* and 
Nt" male (LED n =3 and HED n =4) and female (LFD n = 3 and HED 
n=4) mice. Analysis of cumulative food intake for 22 weeks did not 
show a significant difference except in female mice fed a HFD. *P < 0.05 
versus female Ntt’+ fed a HED (a). For weekly food intake, there was no 
significant difference between genotypes in male mice on both diets (b) 
and female mice fed a LFD (c, left); food intake on week 9 in female mice 
fed a HFD reached significance between genotypes. *P < 0.05 versus Nt! 
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(c, right). d-g, Male mice on LFD and HED diets for 24 weeks were placed 
in individual cages and indirect calorimetric analysis was performed 

(n= 10). Energy expenditure is presented by the average kcal h~' in 24h 
and in resting period (d). Locomotor activity represents counts of beam 
breaks in a 30-min period (e). Energy intake represents the food intake in 
kcal for 24h (f). Resting RER is presented in g. *P < 0.05 versus LFD in 
Nett!* mice; +P < 0.05 versus LFD in Nf" mice. All data are mean +s.d. 
Two-sided Student's t-test per diet and sex groups for a; linear mixed 
model for b, c; ANCOVA for d-g. 
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Extended Data Figure 4 | NT promotes intestinal cell lipid absorption 
or accumulation. a, Faecal weight of male Nt*/* and Nt/~ mice fed either 
normal chow (NC; N¢*/*+ n=4, Nt’ n=5) ora HED (Ntt'* n=7, 

Nt n=9) for 24 weeks; there was no significant difference between 
genotypes. b, Male N¢*/* and Nt/~ mice (1 = 4) on normal chow were fasted 
overnight. Mice were injected with either saline or SR 48692 (2.5 mg per kg 
body weight, i.p). Thirty minutes after the injection, mice were given 
olive oil (1011 per g body weight) by oral gavage and then killed. The 
proximal small intestine was excised and intestinal TG content measured. 
*P <0.05 versus saline in Nt*/* and Nt mice; +P < 0.05 versus olive oil 
only in Nt*/* mice; +P < 0.05 versus olive oil only in Nt*/* mice. c, RIE-1 
cells were pre-treated with BSA or NT (24M) for 30 min, followed by 
incubation with BSA-conjugated BODIPY FL Cj¢ for 15 min, and labelled 
lipids were visualized by confocal microscopy (original magnification, 

x 180). Representative images at x60, zoom 3 are from three experiments. 
d, RIE-1 cells were pre-treated with or without NT at different 
concentrations for 30 min followed by the addition of BSA-conjugated 
oleate (0.1 mM) and further incubation overnight. Cells were collected, 
lysed and TG was measured (n = 3). *P < 0.05 versus BSA only; tP < 0.05 
versus oleate only. e, Total RNA was isolated from human (FHs 74 Int) 
and rat (RIE-1) small intestinal epithelial cells and reverse transcription 
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polymerase chain reaction (RT-PCR) was performed using specific 
primers targeting human or rat NTR1, 2 and 3. f, g, Expression of NTR1 
(f) and NTR3 (g) was analysed by western blot from mucosa scraped from 
mouse proximal (pro), middle (mid), distal (dis) small intestine and colon 
(lanes 3-6) as well as human duodenum (duo) and colon (lanes 7-8). 
Proteins from HepG2 (human hepatocellular carcinoma cell line) (f; lanes 
1-2) and Caco-2 (human colon cancer cell line) (g; lanes 1-2) cells stably 
expressing NTR1, NTR3, or control (NTC) shRNA were used as positive 
and negative controls. h, i, RIE-1 cells transfected with non-targeting 
control (NTC) siRNA or Ntr1 or Ntr3 siRNA for 72h were treated with 

or without NT (2 1M) for 30 min followed by BSA-conjugated BODIPY 
FL Cy¢ (Ci6) for 15 min, and imaged by confocal microscopy (original 
magnification, x 180) to quantify fluorescence (h) and intensity (i) as 
described in Methods (n = 30 cells). *P < 0.05 versus Cyg in NTC siRNA; 
+P <0.05 versus Cy¢ plus NT in NTC siRNA. j, Cumulative (left) and 
weekly (right) food intake was measured in male wild-type C57BL/6 mice 
fed a HFD and chronically treated with either SR 48692 (n = 13) or vehicle 
(n= 12). Neither analysis demonstrated a significant difference. 

All data are mean + s.d. ANOVA with Holm’s P-value adjustment for 

a, b, d, i; two-sided Student's t-test for j (left); linear mixed model for 

j (right). (See Supplementary Fig. 1 for gel source data.) 
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Extended Data Figure 5 | NT negatively regulates AMPK activity. 

a, Western blotting and densitometry of p-AMPK in proximal intestinal 
mucosa of male mice (12 months old) maintained on standard chow 

and fasted for 24h (n=9). *P <0.05 versus N¢*’. b, Western blotting 
and densitometry of p-AMPK levels using samples described in Fig. 2e 
(n=8). *P <0.05 versus saline; +P < 0.05 versus olive oil alone. c, FHs 
74 Int (top) and RIE-1 (bottom) cells were treated with the indicated 
concentrations of oleate for 1h and lysates were analysed by western 
blotting. d, RIE-1 cells were pre-treated with or without NT for 30 min 
followed by combined treatment with BSA or oleate (0.1 mM) for 1h and 
western blot analysis. e, FHs 74 Int cells transfected with either human 
NTRI1 or NTC siRNA (100 nM)), or either human NTR3 or NTC siRNA 
(20 nM) as indicated for 72 h were pre-treated with or without NT (2 uM) 
for 30 min followed by treatment with oleate (0.1 mM) or BSA for Lh 
and western blot analysis. f, RIE-1 cells were transfected with rat siRNAs 
and treated with NT and oleate as described in e. g, FHs 74 Int cells 


transfected with LKB1, CAMKK2 or control (all 40nM) siRNAs for 3 days 
were pre-treated with NT (21M) for 30 min followed by oleate (0.1 mM) 
for 1h and cell extracts were analysed by western blotting. h, FHs 74 Int 
cells transfected with Flag-CAMKK2 and control vector for 48 h were 
pre-treated with or without NT (2 1M) followed by oleate (0.1 mM) or 
BSA for 1h and analysed by western blotting (left); p- AMPK levels were 
determined as in b from three separate experiments (right). *P < 0.05 
versus BSA in vector- and CAMKK2-transfected cells, respectively; 

+P < 0.05 versus BSA in vector-transfected cells; +P < 0.05 versus oleate 
in vector-transfected cells; §P < 0.05 versus BSA in vector-transfected 
cells; ||P < 0.05 versus NT alone in vector-transfected cells; ¢P < 0.05 
versus oleate plus NT in vector-transfected cells. All data are mean + s.d. 
Two-sided Student’s t-test for a (bottom); ANOVA with Holm’s P-value 
adjustment for b (bottom), h (right) (See Supplementary Fig. 1 for gel 
source data.) 
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Extended Data Figure 6 | NT regulates lipid droplet accumulation 
and AMPK activation in Drosophila midgut, fat body, and oenocytes. 
a, Midgut from Gr36C-NT adult (7 days old) was stained for NT and 
Prospero (Pros). Insets in a and b are high-magnification images of 
individual Pros-expressing enteroendocrine cells, co-expressing either 
NT or nuclear green fluorescent protein (GFP) (see later), respectively 
(n= 3). Scale bar, 50 um. b, Similar experiment using Gr36C-Gal4 to 
drive nuclear GFP expression, demonstrating co-localization of GFP and 
Pros in nuclei of enteroendocrine cells in adult flies (7 days old) (n=3). 
Scale bar, 50 um. c, NT expression promotes the accumulation of lipid 
droplets. Midguts from either control Gr36C-Gal4 (left; 100%, n = 15) or 
Gr36C-NT (right; 100%, n = 19) 3rd instar larvae stained with BODIPY. 
Similar results were obtained with Nile Red staining (data not shown). 
Scale bar, 100j1m. d, Western blot analysis of p> AMPK and AMPK levels 
in gastrointestinal tract of Gr36C-Gal4 control and Gr36C-NT 3rd instar 
larvae. e, Conditional expression of AMPK (middle; Myo-AMPK, 29°C, 
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86%, n= 14) or AMPK RNAi (right; Myo-AMPKRNAi**”?!, 29°C, 91%, 
n= 11) leads to either suppression or enhancement of lipid accumulation 
(visualized with BODIPY) as compared to control (left; Myo*-AMPK, 
20°C, 100%, n = 9) 3rd instar larvae. Embryos were raised at 20°C and 

96h after egg laying were switched to the non-permissive temperature 

(29 °C) to induce Gal4 expression. AMPK overexpression and RNAi 
inhibition were monitored by western blotting with AMPK antibody (data 
not shown). Scale bar, 100m. f, Similar results were obtained in 3rd 
instar larval fat bodies using a RU486- inducible $106-Gal4 driver to drive 
AMPK without RU486 (left; control, 100%, n = 9), AMPK with RU486 
(middle; 89%, n = 9) or AMPK RNAi with RU486 (right; 100%, n = 10) 
expression. Scale bar, 50m. g, Lipid accumulation in oenocytes (HNF- 
positive) visualized with BODIPY as in e. Genotypes and fluorescent stains 
are indicated. Compare middle panels (100%, n = 6) and bottom panels 
(89%, n=9) to top control panels (100%, n= 8). Scale bar, 501m. 

(See Supplementary Fig. 1 for gel source data.) 
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Extended Data Figure 7 | LC-MS/MS analysis of the processed NT triply charged NT°* peptide. The m/z values of major fragment ions are 
peptide. a, The mass spectrum of the triply charged NT (NT**) peptide designated, confirming the peptide as biologically active 13-amino-acid NT. 
eluted at 18.96 min in conditioned medium of 82 cells expressing full- c, The amount of NT in S2 medium and fly gastrointestinal tracts 
length human NT precursor. The labelled three peaks are the isotopic from adult Gr36C-NT flies (350 guts collected for each genotype) was 
envelope of NT?* with a mass accuracy less than 2 p.p.m. from the quantified. N.D., not detected. 


theoretical m/z value of 558.31050. b, The tandem mass spectrum of the 
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Extended Data Figure 8 | CG9918 (mouse NTRI analogue) RNAi blocks 
the NT-mediated decrease of p-AMPK levels in Drosophila S2 cells. 

a, S2 cells were transfected with ub-Gal4 plus UAST-NT (ub-Gal4-NT) or 
control (ub-Gal4) vector and treated with the indicated double-stranded 
RNAs (dsRNAs) to knockdown individual receptors. Cell lysates were 
subjected to western blot with the indicated antibodies to examine 

the activation of AMPK (left). The efficiency of RNAi knockdown 

was monitored by real-time PCR (middle; n = 3); *P< 0.05 versus 
control dsRNA. Medium from cells expressing either ub-Gal4 alone or 
ub-Gal4-NT was collected to examine NT levels by enzyme immunoassay 
(EIA)*° (right; n =6); *P <0.05 versus ub-Gal4. b, S2 cells were treated 
with the indicated concentrations of NT peptide for 1h and AMPK 
activation was monitored as in a. NT treatment (0.2, 0.511M) decreased 
p-AMPK to levels similar to those observed in ub-Gal4-NT-transfected 
cells where NT levels reach approximately the same concentration 

(~350 pgml7’, 0.2\.M). c, Midguts of 3rd instar larvae from the indicated 
genotypes were stained with BODIPY to monitor the accumulation of 
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lipid droplets. Left, w1118 control midgut accumulates low level of lipid 
(100%, n= 8). Middle, larval midgut constitutively expressing NT by the 
TK promoter (TK-NT) accumulates high levels of lipid (100%, n= 8). 
Right, larval midgut co-expressing TK-NT and Myo1A-CG9918RNAi?’*°? 
accumulated much lower levels of lipid (87%, n= 15) compared to TK-NT 
midgut (middle). Scale bar, 100 um. d, S2 cells were transfected with 
ub-Gal4-NT or control vector, treated with the indicated dsRNAs, and cell 
lysates were analysed as in a. Inactivation of Capability (Capa; CG15520), 
the Pyrokinin-1 in Drosophila, does not alter the levels of p-AMPK in 
either the presence or absence of NT (top), indicating that NT prevents 
AMPK activation independently of Pyrokinin-1. Capa RNAi efficiency 
was monitored by real time-PCR (bottom; n = 3). *P < 0.05 versus 

control dsRNA. e, Amino acid sequence alignment of mouse NTR1 and 
Drosophila CG9918, identities and conserved residues (+) are indicated. 
All data are mean + s.d. ANOVA for a (middle), d (bottom); two-sided 
Wilcoxon rank-sum test for a (right). (See Supplementary Fig. 1 for gel 
source data.) 
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Extended Data Table 1 | Clinical characteristics of the MDC-CC 


Data are given as mean +s.d. for normally distributed variables, and as median and interquartile range for fasting insulin concentration. Categorical data are presented as numbers (percentages). 
N denotes the number with complete data, thus, included in analyses. 
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Extended Data Table 2 | Continuous values of pro-NT and association with continuous values of clinical outcomes 


p-coefficient + SE 
Metabolic trait (per SD increment of log-transformed pro-NT) p-value 


Body Mass Index (kg/m) 0.14 + 0.06 0.016 


Waist circumference (cm) 0.33 + 0.15 0.026 
HOMA-IR (mmol/L*mU/L) 0.13 + 0.015 <0.001 


Based on linear regression model of log-transformed fasting plasma concentration of pro-NT expressed as per 1 s.d. increment in log-transformed value. Standard error (s.e.) of beta coefficient. 
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Maintenance and propagation of a deleterious 
mitochondrial genome by the mitochondrial 


unfolded protein response 


Yi-Fan Lin!, Anna M. Schulz!, Mark W. Pellegrino!, Yun Lu’, Shai Shaham? & Cole M. Haynes! * 


Mitochondrial genomes (mitochondrial DNA, mtDNA) encode 
essential oxidative phosphorylation (OXPHOS) components. 
Because hundreds of mtDNAs exist per cell, a deletion in a single 
mtDNA has little impact. However, if the deletion genome is 
enriched, OXPHOS declines, resulting in cellular dysfunction. 
For example, Kearns-Sayre syndrome is caused by a single 
heteroplasmic mtDNA deletion. More broadly, mtDNA deletion 
accumulation has been observed in individual muscle cells! and 
dopaminergic neurons” during ageing. It is unclear how mtDNA 
deletions are tolerated or how they are propagated in somatic cells. 
One mechanism by which cells respond to OXPHOS dysfunction 
is by activating the mitochondrial unfolded protein response 
(UPR™), a transcriptional response mediated by the transcription 
factor ATFS-1 that promotes the recovery and regeneration of 
defective mitochondria**. Here we investigate the role of ATFS-1 
in the maintenance and propagation of a deleterious mtDNA ina 
heteroplasmic Caenorhabditis elegans strain that stably expresses 
wild-type mtDNA and mtDNA with a 3.1-kilobase deletion 
(AmtDNA) lacking four essential genes”. The heteroplasmic 
strain, which has 60% AmtDNA, displays modest mitochondrial 
dysfunction and constitutive UPR™ activation. ATFS-1 impairment 
reduced the AmtDNA nearly tenfold, decreasing the total 
percentage to 7%. We propose that in the context of mtDNA 
heteroplasmy, UPR™ activation caused by OXPHOS defects 
propagates or maintains the deleterious mtDNA in an attempt to 
recover OXPHOS activity by promoting mitochondrial biogenesis 
and dynamics. 

mtDNA mutations and deletions are relatively common within 
the mtDNA population of ageing post-mitotic cells but typically 
represent a minor percentage of the total mtDNA®’. However, if 
the deleterious mtDNA reaches a high enough percentage rela- 
tive to wild-type mtDNA, it can, depending on the severity of the 
mtDNA lesion, become toxic and lead to disease®. As mtDNA encodes 
13 (12 in C. elegans) essential components of the respiratory chain 
and ATP synthase, along with rRNAs and tRNAs required for their 
synthesis, an increase in deleterious heteroplasmy can perturb multiple 
cellular activities reliant on OXPHOS. However, the mechanism(s) 
that promote tolerance to large mtDNA deletions or contribute to their 
propagation are unknown. 

Here we examine the role of the UPR™ on the maintenance and 
propagation of a mtDNA with a 3.1 kb deletion (AmtDNA) encod- 
ing four essential OXPHOS genes (Fig. la) using a heteroplasmic 
C. elegans strain’. The UPR™ is regulated by the transcription factor 
ATFS-1, which is normally efficiently imported into mitochondria and 
degraded. However, during mitochondrial stress or OXPHOS dysfunc- 
tion, a percentage of ATFS-1 fails to be imported and traffics to the 
nucleus to activate a broad transcriptional program (over 500 tran- 
scripts) that promotes repair and recovery of mitochondrial function**. 


To examine the effect of AmtDNA on UPR™ activation, AmtDNA 
was crossed into a transcriptional reporter worm harbouring the 
hsp-6p::gfp transgene used to monitor UPR™ activation’. Consistent 
with previous studies demonstrating that perturbation of nuclear- 
encoded OXPHOS components activated the UPR™ (Extended Data 
Fig. 1a)*'°, AmtDNA also modestly activated the UPR™, which 
required atfs-1 (Fig. 1b). Consistent with a previous report’, AmtDNA 
made up 60% of all mtDNAs (Fig. 1c). And, AmtDNA caused a signifi- 
cant reduction in basal oxygen consumption as well as total respiratory 
capacity (Fig. 1d), suggesting 60% AmtDNA perturbs OXPHOS and 
activates the UPR™. 

Surprisingly, the development of worms harbouring AmtDNA was 
unaffected by RNAi-mediated knockdown of atfs-1 (atfs-1(RNAi)), 
in stark contrast to worms with hypomorphic mutations in nuclear- 
encoded OXPHOS components (Fig. le)*"’, indicating the UPR™ is 
not required for development when the OXPHOS defect derives from 
a mtDNA deletion. Thus, we examined the effect of UPR™ inhibi- 
tion on AmtDNA levels by impairing several components required 
for hsp-6p::gfp induction*’*. Notably, atfs-1 deletion or knockdown 
caused a dramatic reduction in AmtDNA levels, shifting the percent- 
age from 60% to 7% (Fig. 1c and Extended Data Fig. 1b-d), probably 
explaining the normal growth rate (Fig. le). AmtDNA quantification 
in individual wild-type or atfs-1-deletion worms was consistent with 
that observed in larger worm populations and in some cases AmtDNA 
was reduced below the limit of detection in worms lacking ATFS-1 
(Extended Data Fig. le). Therefore, ATFS-1 and UPR™ activation are 
required to maintain the deleterious mtDNA. 

A mechanism by which deleterious mtDNA levels can be altered 
involves the germ-line bottleneck where only a small number of 
mtDNAs are passed maternally to the next generation, allowing shifts 
in heteroplasmy’. Because the atfs-1-deletion strain was generated 
via mating, it is unclear if the shift in AmtDNA occurred during 
germ-line transmission, somatic cell division and growth, or both. 
To examine the role of ATFS-1 in AmtDNA maintenance specifically 
in somatic cells, we employed glp-4(bn2) worms that lack germlines 
when raised at the restrictive temperature. Interestingly, exposure of 
worms to atfs-1 RNAi from the L1 stage to adulthood also depleted 
AmtDNA relative to the identical worm population raised on control 
RNAi (Fig. 1f), consistent with a role for the UPR™ in maintaining 
AmtDNA levels in post-mitotic somatic cells. 

Because mitochondrial autophagy (mitophagy) could potentially 
eliminate AmtDNAs when atfs-1 is inhibited, we examined the inter- 
action between AmtDNA, atfs-1 and known mitophagy components. 
Mitophagy involves the recognition of defective mitochondria by the 
kinase PINK-1. Once PINK-1 accumulates, it recruits the ubiquitin 
ligase Parkin (PDR-1 in C. elegans) to the mitochondrial outer mem- 
brane, which directs the damaged organelle to lysosomes for degra- 
dation’. As described previously, pdr-1 deletion resulted in increased 


‘Cell Biology Program, Memorial Sloan Kettering Cancer Center, New York, New York 10065, USA. *Laboratory of Developmental Genetics, The Rockefeller University, New York, New York 10065, 
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Figure 1 | ATFS-1 is required to maintain a deleterious mtDNA. 

a, Comparison of wild-type and uaDf5 deletion (AmtDNA) mtDNAs. 

b, hsp-6p,::gfp in wild-type or AmtDNA worms on control or atfs-1 
RNAi. Scale bar, 0.1mm. c, AmtDNA quantification as determined by 
qPCR in AmtDNA or atfs-1(tm4919); AmtDNA worms. n = 3; error bars, 
mean + s.d.; *P < 0.03 (Student's t-test). d, Oxygen consumption rates 
(OCR) in wild-type and AmtDNA worms. n=5 and n= 10, respectively; 
error bars, mean +s.d.; *P < 0.03 (Student’s t-test). e, AmtDNA or 
isp-1(qm150) worms raised on control or atfs-1 RNAi. Scale bar, 1 mm. 

f, AmtDNA quantification as determined by qPCR in glp-4(bn2); AmtDNA 
worms on control or atfs-1 RNAi. n= 3; error bars, mean + s.d.; *P < 0.03 
(Student’s t-test). 


AmtDNA (Fig. 2a)"4, consistent with mitophagy targeting defec- 
tive mitochondria containing relatively high levels of deleterious 
mtDNAs'°. However, the reduction of AmtDNA caused by atfs-1 
inhibition was only partially blocked by pink-1;pdr-1 deletion or 
atg-18 deletion, which impairs general autophagy (Fig. 2a and 
Extended Data Fig. 2a). Additionally, atfs-1 inhibition did not impair 
development of pink-1;pdr-1-deficient AmtDNA worms (Fig. 2b) 
despite the increased UPR™ activation (Fig. 2c). As PDR-1 inhibition 
did not completely restore AmtDNA in atfs-1-deletion worms, ATFS-1 
probably promotes AmtDNA propagation independent of mitophagy 
or other Parkin-mediated activities'®. 

As modest UPR™ activation was required to maintain the dele- 
tion genome (Fig. 1), we examined the effects of stronger UPR™ acti- 
vation on AmtDNA expansion or propagation. To further activate 
the UPR™ in somatic cells, the mitochondrial protease SPG-7 was 
impaired by RNAi** during development in worms lacking germlines. 
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Despite strong UPR™ activation, AmtDNA levels did not increase 
(Fig. 2d and Extended Data Fig. 2b). However, because PINK-1 and 
Parkin are also activated by mitochondrial unfolded protein stress!’ 
we wondered whether PINK-1 and PDR-1 activities limit AmtDNA 
accumulation. Remarkably, AmtDNA levels increased threefold in 
pink-1;pdr-1-deletion worms raised on spg-7 RNAi (Fig. 2d) suggesting 
that UPR™ activation can promote propagation of deleterious 
mtDNAs, which is antagonized by mitophagy during strong mitochon- 
drial stress. However, because wild-type mtDNAs were also increased, 
the AmtDNA percentage was unaffected (Fig. 2d). As both wild-type 
and AmtDNA were increased by spg-7(RNAi), ATFS-1 may promote 
a mitochondrial biogenesis program in response to mitochondrial 
dysfunction. Notably, mitochondrial mass!® also increased in pink-1; 
par-1-deficient worms treated with spg-7 RNAi (Fig. 2e) consistent 
with ATFS-1 mediating a compensatory mitochondrial biogenesis 
program that maintains AmtDNA. Combined, these results suggest 
that during strong mitochondrial stress, balanced PINK-1/PDR-1 and 
ATFS-1 activity limits AmtDNA accumulation. 

To determine if ATFS-1 activation is sufficient to increase mito- 
chondrial biogenesis and AmtDNAs independent of mitophagy, we 
used a mutant strain with constitutive UPR™ activation owing to an 
amino acid substitution within the ATFS-1 mitochondrial targeting 
sequence”. Consistent with the UPR™ promoting a mitochondrial 
biogenesis program, atfs-1(et18) animals also displayed a marked 
increase in the number of mitochondria (Fig. 3a). atfs-1(et18) worms 
also had increased AmtDNA and wild-type mtDNAs (Fig. 3b), 
however AmtDNA was further increased than wild-type mtDNAs, 
resulting in an increase of AmtDNA from 63% to 73%. Consistent 
with increased AmtDNAs, atfs-1(et18); AmtDNA worms developed 
slower than atfs-1(et18) worms (Fig. 3c) or AmtDNA worms”. And 
consistent with further OXPHOS impairment, atfs-1(et18); AmtDNA 
worms consumed less oxygen (Fig. 3d), had reduced mitochondrial 
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Figure 2 | ATFS-1 promotes AmtDNA maintenance and mitochondrial 
biogenesis largely independent of Parkin. a, AmtDNA quantification 

as determined by qPCR in AmtDNA and pdr-1(tm598);pink-1(tm1779); 
AmtDNA worms raised on control or atfs-1 RNAi. n = 3; error bars, 
mean +s.d.; *P < 0.03 (Student's t-test). b, pdr-1(tm598);pink-1(tm1779); 
AmtDNA worms raised on control or atfs-1 RNAi. Scale bar, 1 mm. 

c, pdr-1(tm598);hsp-6p::gfp worms with wild-type or AmtDNA. Scale bar, 
0.1mm.d, AmtDNA quantification as determined by qPCR in 
glp-4(bn2); AmtDNA and glp-4(bn2);pdr-1(tm598);pink-1(tm1779); 
AmtDNA worms raised on control or spg-7 RNAi. n = 3; error bars, 

mean + s.d.; *P < 0.03 (Student's t-test). e, ges-Ipr::gfp'""; AmtDNA or 
ges-Ipr:g fp ;pdr-1 (tm598);pink-1(tm1779);AmtDNA worms raised on 
control or spg-7 RNAi. WT, wild type. Scale bar, 0.1 mm. 
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membrane potential (Extended Data Fig. 2c) and were sensitive to 
the OXPHOS inhibitor rotenone (Extended Data Fig. 2d). Combined, 
these results indicate that UPR™ activation is detrimental in the 
presence of AmtDNAs. 

To investigate potential causes of the OXPHOS deficiency and 
UPR™ activation in AmtDNA worms, we compared expression of 
a gene within the AmtDNA deletion (ND2), that is only expressed 
by the population of wild-type mtDNAs, to mtDNA-encoded genes 
located outside of the deletion that are encoded by both wild-type and 
AmtDNAs (ND4, ND6) (Fig. 1a). Surprisingly, ND2 transcripts were 
not reduced in AmtDNA or atfs-1(et18); AmtDNA animals despite 
the absence of the ND2 gene in AmtDNA (Fig. 3e). Interestingly, 
ND4 was expressed significantly more in both AmtDNA and atfs-1 
(et18); mtDNA relative to wild-type worms while the nuclear- 
encoded complex I transcript nuo-4 was unaffected (Fig. 3e). These 
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Figure 3 | ATFS-1 activation causes deleterious mtDNA expansion. 

a, ges-Ip)::g fp"; AmtDNA or ges-1p;::¢fp":atfs- 1(et18); AmtDNA worms 
raised on control or fzo-1 RNAi. The bottom panels are a single intestinal 
cell. WT, wild type. Scale bar, 0.1mm (top) and 1 1m (bottom). 

b, AmtDNA quantification as determined by qPCR in AmtDNA or 
atfs-1(et18);AmtDNA worms. WT, wild type. n = 3; error bars, 

mean + s.d.; *P < 0.03 (Student's t-test). c, Synchronized atfs-1(et18) 

and atfs-1(et18); mtDNA worms. Scale bar, 1 mm. d, Oxygen 
consumption rates (OCR) of wild-type, AmtDNA, atfs- 1(et18) 

and atfs-1(et18);AmtDNA worms. n= 10; error bars, mean +s.d.; 

*P < 0.03 (Student's t-test). e, ND2, ND4, ND6 and nuo-4 transcripts 

as determined by qRT-PCR in wild-type, AmtDNA, atfs-1(et18) and 
atfs-1(et18); AmtDNA worms. n= 3; error bars, mean + s.d.; *P < 0.03 
(Student’s t-test). RFU, relative fluorescent units. f, An intestinal cell 

in AmtDNA and atfs-1(et18); AmtDNA worms. Mitochondria (white) 
and autophagosomes (orange) are indicated. Scale bars, 500 nm (lower 
right, 300 nm). g,h, AmtDNA quantification as determined by qPCR in 
AmtDNA, atfs-1(et18); AmtDNA or pdr-1(tm598);atfs-1(et18); AmtDNA 
worms raised on control, fzo-1, drp-1, or polg-1 RNAi. n= 3; error bars, 
mean + s.d.; *P < 0.03 (Student’s t-test). 
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results suggest that mitochondrial dysfunction in AmtDNA worms 
is not due to reduction of transcripts encoded by genes within the 
mtDNA deletion. 

Next, we performed electron microscopy to examine mitochondrial 
morphology in atfs-1(et18) harbouring 73% AmtDNA. The cristae 
were largely absent in these mitochondria (Fig. 3f), consistent with 
severe mitochondrial dysfunction and reduced OXPHOS (Fig. 3d). 
Notably, a number of autophagosome-like structures associated with 
degenerate mitochondria were observed only in atfs-1(et18); AmtDNA 
worms (Fig. 3f). And, consistent with increased mitophagy in 
atfs-1(et18); AmtDNA animals, pdr-1 deletion further increased the 
percentage of AmtDNA (Fig. 3g). Combined, these results suggest 
that in the context of deleterious heteroplasmy, increased UPR™ acti- 
vation can further perturb mitochondrial function, potentially leading 
to increased mitophagy. 

To better understand the ATFS-1-mediated transcriptional program 
that promotes AmtDNA maintenance and propagation, the transcrip- 
tomes of wild-type and atfs-1(et18) worms were examined. Activated 
ATFS-1 induced many transcripts suggestive of mitochondrial bio- 
genesis, including the mitochondrial protein import machinery, the 
cardiolipin synthesis enzyme tafazzin, mitochondrial ribosome and 
translation factors, prohibitin complex components, mitochondrial 
chaperones and OXPHOS assembly factors (Supplementary Table 1), 
consistent with ATFS-1 regulating a mitochondrial biogenesis and 
mitochondrial proteostasis program to recover mitochondrial 
function. atfs-1(et18) also increased the mtDNA polymerase polg-1, 
the worm orthologue of the mtDNA-binding protein TFAM (hmg-5), 
as well as transcripts required for mitochondrial dynamics 
(Supplementary Table 1 and Extended Data Fig. 3a). 

Interestingly, inhibition of mitochondrial fusion or fission by 
fzo-1(RNAi) or drp-1(RNAi) reduced the AmtDNA percentage in a 
similar manner to atfs-1(RNAi) (Fig. 3h) in AmtDNA worms, sug- 
gesting that organelle dynamics stimulated by the UPR™ promote 
deleterious mtDNA maintenance. Because fzo-1(RNAi) inhibition 
had relatively little effect on mitochondrial biogenesis (Fig. 3a), only 
organelle morphology (Fig. 3a, lower panel), we speculate that orga- 
nelle mixing mediated by drp-1 and fzo-1, which requires OXPHOS 
function’, limits the enrichment of deleterious mtDNAs in individ- 
ual organelles and promotes tolerance to deleterious mtDNAs”!”?. 
Consistent with this idea, development of atfs-1(et18); AmtDNA ani- 
mals was delayed when mitochondrial fusion was inhibited (Extended 
Data Fig. 3b). Impairment of polg-1 and hmg-5 also reduced the 
AmtDNA percentage in AmtDNA worms (Fig. 3h and Extended Data 
Fig. 3c), suggesting that replication and mtDNA protection is involved 
in maintaining the deleterious genome. Combined, these data suggest 
that through multiple outputs, ATFS-1 activation provides favourable 
conditions for AmtDNA proliferation. 

These results suggest an unanticipated consequence of ATFS-1 and 
UPR™ activation in the context of deleterious mtDNA heteroplasmy. 
While the UPR™ is protective during exposure to mitochondrial 
toxins*’ or mutations within nuclear-encoded OXPHOS genes*”’, in 
the context of mtDNA heteroplasmy, UPR™ activation maintains the 
deleterious mtDNA. We propose that by inducing a mitochondrial 
recovery program, UPR™ activation inadvertently propagates deleteri- 
ous mtDNAs in an attempt to recover OXPHOS activity. These results 
potentially shed light on the underlying mechanisms that lead to mito- 
chondrial diseases and the enrichment of AmtDNAs found in aged 
cells”®. They also emphasize the importance of UPR™ regulation and 
suggest that prolonged UPR™ activation is potentially harmful’’, as 
ATFS-1 activation creates an environment favourable for AmtDNAs. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Worm strains, staining and rotenone treatment. The atfs-1(tm4919), 
pink-1(tm1779) and pdr-1(tm598) strains were obtained from the National 
BioResource Project (Tokyo, Japan) and the N2 (wild-type mtDNA), LB138 
(uaDf5 or AmtDNA), atg-18(gk378), isp-1(qm150) and glp-4(bn2) strains from the 
Caenorhabditis Genetics Center (Minneapolis, Minnesota). The atfs-1(et18) strain 
was a gift from M. Pilon. The ges-1p,::gfp’" worms used for visualizing mitochon- 
drial content and the reporter strain hsp-6,,::gfp for visualizing UPR™ activation 
have been described””*. Hermaphrodites were used for all experiments. Rotenone 
treatment and TMRE staining was performed by synchronizing and raising worms 
on plates previously soaked with M9 buffer containing rotenone (Tocris), DMSO, 
or TMRE (Molecular Probes). TMRE-stained worms were subsequently placed 
on control plates to rmove TMRE-containing bacteria in the digestive tract for 
3h before imaging. 

mtDNA quantification. Wild-type mtDNA and AmtDNA quantification was 
performed using qPCR-based methods similar to previously described assays'*”°. 
20-30 worms were collected in 30 ul of lysis buffer (50 mM KCl, 10 mM Tris-HCl 
(pH 8.3), 2.5mM MgCl, 0.45% NP-40, 0.45% Tween 20, 0.01% gelatin, with 
freshly added 200 1g ml’ proteinase K) and frozen at —80 °C for 20 min before 
lysis at 65°C for 80 min. Relative quantification was used for determining the 
fold changes in mtDNA between samples. 1 jl of lysate was used in each triplicate 
qPCR reaction. qPCR was performed using the Thermo-Scientific SyBr Green 
Maxima Mix and the MyiQ2 Two-Colour Real-Time PCR Detection System 
(Bio-Rad Laboratories). Primers that specifically amplify wild-type or AmtDNA 
are listed in Supplementary Table 2. Primers that amplify a non-coding region 
near the nuclear-encoded ges-1 gene were used as a control (Supplementary 
Table 2). 

Absolute quantification was also performed to determine the percentage or 
ratio of AmtDNA relative to total mtDNA (AmtDNA and wild-type mtDNA). 
This approach was also employed to determine AmtDNA number per nuclear 
genome in the individual worm analysis. Standard curves for each qPCR primer 
set were generated using purified plasmids individually containing approximately 
1 kb of the mtDNA or nuclear DNA sequence specific for each primer set!*”°. The 
concentration of the purified plasmids was determined using a Nanodrop spec- 
trometer (Thermo Scientific). mtDNA was harvested from synchronized worms 
at the L4 stage. For the experiments involving the glp-4(bn2) strain, worms were 
raised at 25°C and harvested 3 days after hatching. All qPCR results are pre- 
sented as technical replicates, but each experiment has been repeated three or 
more times. A Student’s t-test was employed to determine the level of statistical 
significance. 

RNA isolation and qRT-PCR analysis. Worms were synchronized and raised 
in liquid culture until the L4 stage when they were harvested and compacted 
into pellets on ice. Total RNA was extracted from a 30-50 1l worm pellet using 
RNA STAT (Tel-Test). For the analysis of mtDNA-encoded mRNAs, the RNA 
extracts were then treated with DNase using the DNA-free kit (Ambion) to reduce 
mtDNA contamination. 1 1g of RNA was used for synthesizing cDNA with the 
iScript cDNA Synthesis Kit (Bio-Rad Laboratories). qPCR was performed using 
the Thermo-Scientific SyBr Green Maxima Mix and the MyiQ2 Two-Colour Real- 
Time PCR Detection System (Bio-Rad Laboratories). Primer sequences are listed 
in Supplementary Table 2. All qPCR results are presented as technical replicates, 


but each experiment has been repeated three or more times. A Student's t-test was 
employed to determine the level of statistical significance. 

Statistics. All experiments were performed three times yielding similar results 
and comprised of biological replicates. The sample size and statistical tests were 
chosen based on previous studies with similar methodologies and the data met the 
assumptions for each statistical test performed. No statistical method was used in 
deciding sample sizes. No blinded experiments were performed and randomization 
was not used. For all figures, the mean + standard deviation (s.d.) is represented 
unless otherwise noted. 

Imaging. Whole-worm images were obtained using either a Zeiss AxioCam MRm 
camera mounted on a Zeiss Imager Z2 microscope or a Zeiss M2BIO dissect- 
ing scope. Exposure times were the same within each experiment. Fluorescent 
(ges-Ip,::gfp"™) mitochondrial morphology images of the most proximal intestinal 
cells were taken with a Nikon Eclipse Ti confocal microscope. The signal intensity 
of each image was adjusted to highlight the differences in mitochondrial morphol- 
ogy as opposed to mitochondrial density. All images are representative of more 
than three images. 

Electron microscopy. Animals were prepared for electron microscopy using 
standard methods’’. Ultrathin serial sections (80 nm) were collected using a Leica 
Ultracut UCT Ultramicrotome. Sections at two levels, 100 1m and 110\1m away 
from the head region, for each genotype were examined. Electron microscopy 
images were acquired using a FEI Tecnai G2 Spirit BioTwin transmission electron 
microscope operating at 120 kV with a Gatan 4K x 4K digital camera. 
Microarray analysis. Wild-type (N2) and atfs-1(et18) worms were synchronized by 
bleaching, and harvested at the L4 stage of development. Total RNA was extracted 
using the RNA STAT reagent (Tel-Test) and used for double-stranded cDNA syn- 
thesis using the iScript cDNA Synthesis Kit (Bio-Rad Laboratories). Microarray 
analysis using GeneChip C. elegans genome arrays (Affymetrix) was conducted 
as previously described‘. Differences in gene expression between wild-type and 
atfs-1(et18) worms was determined using Anova streamlined (Partek Genomic 
Suite (v6.5)). Supplementary Table 1 contains the relative fold induction and 
P values for each mRNA. Confirmation of the microarray results was performed 
via qRT-PCR as described‘. Primers are listed in Supplementary Table 2. 
Oxygen consumption. Oxygen consumption was measured using a Seahorse 
XF°96 Analyzer at 25°C similar to that described previously”®. In brief, adult worms 
were transferred onto empty plates and allowed to completely digest the remaining 
bacteria for 1h, after which four worms were transferred into each well of a 96-well 
microplate containing 20011 M9 buffer. Basal respiration was measured for a total 
of 60 min, in 6-min intervals that included a 2 min mix, a 2 min time delay and a 
2 min measurement. To measure respiratory capacity, 15 4M FCCP was injected, 
the OCR (oxygen consumption rate) reading was allowed to stabilize for 15 min 
then measured for five consecutive intervals. Each measurement was considered 
one technical replicate. 


26. He, L. et al. Detection and quantification of mitochondrial DNA deletions in 
individual cells by real-time PCR. Nucleic Acids Res. 30, e68 (2002). 

27. Bargmann, C. |., Hartwieg, E. & Horvitz, H. R. Odorant-selective genes and 
neurons mediate olfaction in C. elegans. Cell 74, 515-527 (1993). 

28. Andreux, P. A. et al. A method to identify and validate mitochondrial 
modulators using mammalian cells and the worm C. elegans. Sci. Rep. 4, 
5285 (2014). 
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Extended Data Figure 1 | RNAi of nuclear encoded OXPHOS AmtDNA quantification as determined by qPCR in AmtDNA worms 


components activates the UPR™, and UPR™ signalling components 
are required for AmtDNA propagation. a, Images of hsp-6,,::gfp worms 
raised on nuo-2 (complex I), cyc-1 (cytochrome c) or atp-2 (complex V) 
RNAi. Scale bar, 0.1 mm. b, Relative AmtDNA quantification as 
determined by qPCR in AmtDNA or atfs-1(tm4525); AmtDNA worms. 

n = 3; error bars, mean +s.d.; *P < 0.03 (Student’s t-test). c, Relative 


raised on control, dve-1, or ubl-5 RNAi. n = 3; error bars, mean +s.d.; 

*P < 0.03 (Student's t-test). d, Images of dve-1,,::dve-1::gfp in wild-type or 
AmtDNA worms. Scale bar, 0.1 mm. e, Ratios of AmtDNA and nuclear 
genomic DNA as determined by qPCR in individual AmtDNA and 
atfs-1(tm4919); mtDNA worms. Black bars represent the mean. n= 16; 
mean P < 0.0001 (Student’s t-test). 
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Extended Data Figure 2 | The reduction in AmtDNA caused by 
atfs-1(RNAi) is largely independent of autophagy, and ATFS-1 
activation in the presence of a deleterious mtDNA is harmful. 

a, Relative AmtDNA quantification as determined by qPCR in AmtDNA 
and atg-18(gk378); AmtDNA worms raised on control or atfs-1 RNAi. 
n= 3; error bars, mean +s.d.; *P < 0.03 (Student's t-test). b, Images of 
hsp-6pr::gfp worms raised on control or spg-7 RNAi. Scale bar, 0.1 mm. 

c, Images of TMRE-stained wild-type, AmtDNA, atfs-1(et18) and atfs-1 
(et18);AmtDNA worms. Scale bar, 0.1 mm. d, Images of synchronized 
atfs-1(et18) and atfs-1(et18); AmtDNA worms raised on 2.5 1M rotenone 
4 days after hatching. Scale bar, 1 mm. 
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Extended Data Figure 3 | polg-1, hmg-5 and drp-1 mRNAs are induced 
in atfs-1(et18) worms, and mitochondrial fusion is required for the 
development of atfs-1(et18) worms harbouring AmtDNA. a, polg-1, 
hmg-5 and drp-1 transcripts as determined by qRT-PCR in wild-type and 
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t-test). RFU, random fluorescent units. b, Images of synchronized 
AmtDNA, atfs-1(et18), and atfs-1(et18); AmtDNA worms raised on 
control or fzo-1 RNAi. Scale bar, 1mm. c, AmtDNA quantification as 
determined by qPCR in AmtDNA worms raised on control or hmg-5 
RNAi. n = 3; error bars, mean +s.d.; *P < 0.03 (Student’s t-test). 
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Programmable editing of a target base in genomic 
DNA without double-stranded DNA cleavage 


Alexis C. Komor!?, Yongjoo B. Kim!?, Michael S. Packer, John A. Zuris!* & David R. Liu! 


Current genome-editing technologies introduce double- 
stranded (ds) DNA breaks at a target locus as the first step to gene 
correction’. Although most genetic diseases arise from point 
mutations, current approaches to point mutation correction are 
inefficient and typically induce an abundance of random insertions 
and deletions (indels) at the target locus resulting from the cellular 
response to dsDNA breaks’. Here we report the development of 
‘base editing, a new approach to genome editing that enables the 
direct, irreversible conversion of one target DNA base into another 
in a programmable manner, without requiring dsDNA backbone 
cleavage or a donor template. We engineered fusions of CRISPR/ 
Cas9 and a cytidine deaminase enzyme that retain the ability to 
be programmed with a guide RNA, do not induce dsDNA breaks, 
and mediate the direct conversion of cytidine to uridine, thereby 
effecting a CT (or G—A) substitution. The resulting “base 
editors’ convert cytidines within a window of approximately five 
nucleotides, and can efficiently correct a variety of point mutations 
relevant to human disease. In four transformed human and murine 
cell lines, second- and third-generation base editors that fuse uracil 
glycosylase inhibitor, and that use a Cas9 nickase targeting the 
non-edited strand, manipulate the cellular DNA repair response 
to favour desired base-editing outcomes, resulting in permanent 
correction of ~15-75% of total cellular DNA with minimal 
(typically <1%) indel formation. Base editing expands the scope 
and efficiency of genome editing of point mutations. 

The clustered regularly interspaced short palindromic repeat 
(CRISPR) system has been widely used to mediate genome editing 
in a variety of organisms and cell lines*. CRISPR/Cas9 protein-RNA 
complexes localize to a target DNA sequence through base pairing with 
a guide RNA, and natively create a dsDNA break (DSB) at the locus 
specified by the guide RNA. In response to DSBs, cellular DNA repair 
processes mostly result in random insertions or deletions (indels) at the 
site of DNA cleavage through non-homologous end joining (NHEJ). In 
the presence of a homologous DNA template, the DNA surrounding the 
cleavage site can be replaced through homology-directed repair (HDR). 
HDR competes with NHEJ during the resolution of DSBs, and indels 
are generally more abundant outcomes than gene replacement. For 
most known genetic diseases, however, correction of a point mutation 
in the target locus, rather than stochastic disruption of the gene, is 
needed to study or address the underlying cause of the disease’. 

Motivated by this need, researchers have sought to increase the 
efficiency of HDR and suppress NHEJ. Despite recent progress 
(see Supplementary Information for a detailed discussion), current 
strategies to correct point mutations using HDR under therapeutically 
relevant conditions remain inefficient (typically ~0.1 to 5%)>°, espe- 
cially in unmodified, non-dividing cells. These observations highlight 
the need to develop alternative approaches to correct point mutations 
in genomic DNA that do not require DSBs. 

We envisioned that direct conversion of one DNA base to another 
at a programmable target locus without requiring DSBs could increase 


the efficiency of gene correction relative to HDR without introduc- 
ing an excess of random indels. Catalytically dead Cas9 (dCas9), 
which contains Asp10Ala and His840Ala mutations that inactivate 
its nuclease activity, retains its ability to bind DNA in a guide RNA- 
programmed manner, but does not cleave the DNA backbone’. In 
principle, conjugation of dCas9 with an enzymatic or chemical catalyst 
that mediates the direct conversion of one base to another could enable 
RNA-programmed DNA base editing. 

The deamination of cytosine (C) is catalysed by cytidine deam- 
inases® and results in uracil (U), which has the base-pairing prop- 
erties of thymine (T). Most known cytidine deaminases operate on 
RNA, and the few examples that are known to accept DNA require 
single-stranded (ss) DNA’. Recent studies on the dCas9-target DNA 
complex reveal that at least nine nucleotides (nt) of the displaced 
DNA strand are unpaired upon formation of the Cas9-guide RNA- 
DNA ‘R-loop complex'®. Indeed, in the structure of the Cas9 R-loop 
complex, the first 11 nt of the protospacer on the displaced DNA 
strand are disordered, suggesting that their movement is not highly 
restricted". It has also been speculated that Cas9 nickase-induced 
mutations at cytosines in the non-template strand might arise from 
their accessibility by cellular cytosine deaminase enzymes'*. We rea- 
soned that a subset of this stretch of ssDNA in the R-loop might serve 
as an efficient substrate for a dCas9-tethered cytidine deaminase to 
effect direct, programmable conversion of C to U in DNA (Fig. 1a). 

Four different cytidine deaminase enzymes (human AID, human 
APOBEC3G, rat APOBEC1, and lamprey CDA1) were evaluated for 
ssDNA deamination. Of the four enzymes, rat APOBEC1 showed the 
highest deaminase activity under the conditions tested (Extended Data 
Fig. 1a). Fusing rat APOBECI1 to the amino terminus, but not the 
carboxy terminus, of dCas9 preserves deaminase activity (Extended 
Data Fig. 1a). We expressed and purified four rat APOBEC1-dCas9 
fusions with linkers of different length and composition (Extended 
Data Fig. 1b), and evaluated each fusion for single guide RNA 
(sgRNA)-programmed dsDNA deamination in vitro (Fig. 1b and 
Extended Data Fig. 1c-f). 

We observed efficient, sequence-specific, sgRNA-dependent C to 
U conversion in vitro (Fig. 1c). Conversion efficiency was greatest 
using rat APOBEC1-dCas9 linkers over nine amino acids in length. 
The number of positions susceptible to deamination (the ‘activity 
window ) increases from approximately 3 to 6nt as the linker length 
was extended from 3 to 21 amino acids (Extended Data Fig. 1c-f). 
The 16-residue XTEN linker’? offered a promising balance between 
these two characteristics, with an efficient deamination window of 
approximately 5 nt, typically from positions 4 to 8 within the pro- 
tospacer, counting the end distal to the protospacer-adjacent motif 
(PAM) as position 1. The rat APOBEC1-XTEN-dCas9 protein served 
as the first-generation base editor (BE1). 

We assessed the ability of BE1 in vitro to correct seven T—C muta- 
tions relevant to human disease (Extended Data Fig. 2). BE1 yielded 
products consistent with efficient editing of the target C, or of at least 
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Figure 1 | First-generation base editor (BE1) mediates specific, guide 
RNA-programmed C—+U conversion in vitro. a, Base editing strategy. 
DNA with a target C (red) at a locus specified by a guide RNA (green) 
is bound by dCas9 (blue), which mediates local DNA strand separation. 
Cytidine deamination by a tethered APOBEC1 enzyme (red) converts 
the single-stranded target CU. The resulting G:U heteroduplex can be 
permanently converted to an A:T base pair following DNA replication 
or DNA repair. b, Deamination assay showing a BE] activity window of 
approximately five nucleotides. Samples were prepared as described in 
the Methods. Each lane is labelled according to the position of the target 
C within the protospace, or with ‘-’ if no target C is present, counting the 
base distal from the PAM as position 1. c, Deamination assay showing the 
sequence specificity and sgRNA-dependence of BE1. The DNA substrate 
with C at position 7 in b was incubated with BE1 and the correct sgRNA, 
a mismatched sgRNA or no sgRNA. The positive control sample used a 
synthetic DNA substrate with a U at position 7. For uncropped gel data, 
see Supplementary Figure 1. 


one C within the activity window when multiple Cs were present, in 
six of these seven targets in vitro, with an average apparent editing 
efficiency of 44% (Extended Data Fig. 2). 

Although the preferred sequence context for APOBEC1 substrates 
is TC or CC, we anticipated that the increased effective molarity of 
the tethered deaminase and its ssDNA substrate upon dCas9 binding 
might relax this preference. To illuminate the context dependence of 
BEI, we assayed its ability to edit a dsDNA 60-mer containing a single 
fixed C at position 7 within the protospacer, as well as all 36 single- 
mutant variants in which protospacer bases 1-6 and 8-13 were indi- 
vidually varied to each of the other three bases. High-throughput DNA 
sequencing revealed 50-80% C to U conversion of substrate strands 
(25-40% of sequence reads from both DNA strands, one of which is 
not a substrate for BE1) (Fig. 2a). Editing efficiency was independ- 
ent of sequence context, unless the base immediately 5’ of the target 
C was a G, in which case editing efficiency was substantially lower 
(Fig. 2a). Next we assessed BE] activity in vitro on all four NC motifs 
at positions 1 through 8 within the protospacer (Fig. 2b). BE1 activity 
followed the order TC > CC > AC > GC (the second nucleotide (C) 
is the target nucleotide), with maximum editing efficiency achieved 
when the target C is at or near position 7 (see Supplementary 
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Figure 2 | Effects of sequence context and target C position on base 
editing efficiency in vitro. a, Effect of changing the sequence surrounding 
the target C on editing efficiency in vitro. The deamination yield of 80% 
of targeted strands (40% of total sequencing reads from both strands) for 
C, in the protospacer sequence 5’-T TATTT(C7)GTGGATTTATTTA-3’ 
was defined as 1.0, and the relative deamination efficiencies of substrates 
containing all possible single-base mutations at positions 1-6 and 8-13 
are shown. b, Positional effect of each NC motif on editing efficiency 

in vitro. Each NC target motif was varied from positions 1 to 8 within the 
protospacer as indicated in the sequences shown on the right. The PAM is 
shown in blue. The graph shows the percentage of total DNA sequencing 
reads containing T at each of the numbered target C positions following 
incubation with BE1. Note that the maximum possible deamination yield 
in vitro is 50% of total sequencing reads (100% of targeted strands). Values 
and error bars reflect the mean and standard deviation of three (a) or 

two (b) independent biological replicates performed on different days. 


Information). In addition, we observed that the base editor is proces- 
sive, and will efficiently convert most or all Cs to Us on the same DNA 
strand within the five-base activity window (Extended Data Fig. 3). 

While BE]1 efficiently processes substrates in a test tube, in cells, a 
variety of possible DNA repair outcomes determines the fate of the 
initial U:G product of base editing (Fig. 3a). We tested the ability of 
BE1 to convert CT in human cells on 14 Cs in six well-studied tar- 
get sites in the human genome (see Supplementary Information and 
Extended Data Fig. 4a)'°. Although C—T editing in cells was observed 
for all cases, the efficiency of base editing was 0.8% to 7.7% of total 
DNA sequences, a large 5- to 36-fold decrease in efficiency compared 
to that of in vitro base editing (Fig. 3b and Extended Data Fig. 4). 

We hypothesized that the cellular DNA repair response to U:G 
heteroduplex DNA was responsible for the large decrease in base edit- 
ing efficiency in cells (Fig. 3a). Uracil DNA glycosylase (UDG) cataly- 
ses removal of U from DNA in cells and initiates base-excision repair 
(BER), with reversion of the U:G pair to a C:G pair as the most common 
outcome (Fig. 3a)’®. Uracil DNA glycosylase inhibitor (UGI), an 
83-residue protein from Bacillus subtilis bacteriophage PBS1, potently 
blocks human UDG activity (ICs) = 12 pM)”’. In an effort to subvert 
BER at the site of base editing, we fused UGI to the C terminus of BE1 
to create a second-generation base editor (BE2, APOBEC-XTEN- 
dCas9-UGI) and repeated editing assays on all six genomic loci. 
Editing efficiencies in human cells were on average threefold higher 
with BE2 than BEI, resulting in gene conversion efficiencies of up to 
20% of total DNA sequenced (Fig. 3b). 

Importantly, BE] and BE2 resulted in indel formation rates < 0.1% 
(Fig. 3c and Extended Data Table 1), consistent with the known 
mechanistic dependence of NHEJ on DSBs (see Supplementary 
Information)'*. We assessed BE2-mediated base editing efficiencies on 
the same genomic targets in U2OS cells, and observed results similar 
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to those in HEK293T cells (Extended Data Fig. 5). Together, these 
results indicate that conjugating UGI to BE] can increase the efficiency 
of base editing in human cells. 

Converting and protecting the substrate strand of a C:G base pair 
(bp) results in a maximum base editing yield of 50%. To augment base 
editing efficiency beyond this limit, we sought to further manipulate 
cellular DNA repair to induce correction of the non-edited strand 
containing the G. Eukaryotic mismatch repair (MMR) uses nicks 
present in newly synthesized DNA to direct removal and resynthe- 
sis of the newly synthesized strand (Fig. 3a)'”?°. We reasoned that 
nicking the DNA strand containing the unedited G would simulate 
newly synthesized DNA, inducing MMR, or simulate damaged DNA, 
inducing long-patch BER, to preferentially resolve the U:G mismatch 
into desired U:A and T:A products (Fig. 3a). We therefore restored 
the catalytic His residue at position 840 in the Cas? HNH domain 
of BE2 (ref. 7), resulting in the third-generation base editor (BE3, 
APOBEC-XTEN-dCas9(A840H)-—UGI) that nicks the non-edited 
strand containing a G opposite the edited U. BE3 retains the Asp10Ala 
mutation in Cas9 that prevents dsDNA cleavage, and also retains UGI 
to suppress UDG- initiated BER of the editing strand. 

Nicking the non-edited strand augmented base editing efficiency in 
human cells treated with BE3 by an additional two- to sixfold relative 
to BE2, resulting in up to 37% of total DNA sequences containing 
the targeted CT conversion (Fig. 3b). Importantly, only a small fre- 
quency of indels, averaging 1.1% for the six tested loci, was observed 
from BE3 treatment (Fig. 3c and Extended Data Table 1). In contrast, 
when we treated cells with wild-type Cas9, sgRNA to target each of 
three loci, and a 200 nt ssDNA donor template to mediate HDR, we 
observed C—+T conversion efficiencies averaging only 0.5%, with 
much higher indel formation averaging 4.3% (Fig. 3c). The ratio of 
allele conversion to NHEJ outcomes averaged >1,000 for BE2, 23 for 
BE3, and 0.17 for wild-type Cas9 (Fig. 3c). We confirmed the per- 
manence of base editing in human cells by monitoring editing effi- 
ciencies over multiple cell divisions in HEK293T cells at the HEK293 
site 3 and 4 genomic loci (Extended Data Fig. 6 and Supplementary 
Information). These results collectively establish that base editing can 
induce much more efficient targeted single-base editing in human cells 
than Cas9-mediated HDR, and with substantially less (BE3) or almost 
no (BE2) indel formation. 

Next, we examined the off-target activity of BE1, BE2, and BE3 
in human cells for five previously studied loci (see Supplementary 
Information and Supplementary Tables 1-5). Because the sequence 
preference of rat APOBEC1 is known to be independent of bases 
more than one nucleotide from the target C (ref. 21), consistent with 
Fig. 2a, we assumed that off-target base editing arises from off- 
target Cas9 binding. Therefore, we sequenced the top 34 known 
Cas9 off-target sites in human cells'’, and the top 12 known dCas9 
off-target binding sites for the six genomic loci studied in Fig. 3 
(Supplementary Tables 1-5)?*. We observed detectable off-target 
base editing at a subset of known Cas9 off-target sites (16/34 for BE1 
and BE2; and 17/34 for BE3), but no detectable base editing at the 
known dCas9 off-target sites. All detected off-target base-editing 
substrates contained a C within the five-base activity window (see 
Supplementary Information). We also monitored CT mutations at 
3,200 cytosines surrounding the six on-target and 44 off-target loci 
tested and observed no evident increase in CT conversions outside 
the protospacer upon BE1, BE2, or BE3 treatment compared to that of 
untreated cells (Extended Data Fig. 7). Taken together, these findings 
suggest that off-target substrates of base editors include a subset of 
Cas9 off-target substrates, and that base editors in human cells do not 
induce untargeted C—T conversion throughout the genome. 

Finally, we tested the potential of base editing to correct two 
disease-relevant mutations in mammalian cells. The apolipoprotein 
E gene variant APOE4 encodes two arginine residues at amino acid 
positions 112 and 158, and is the largest and most common genetic 
risk factor for late-onset Alzheimer’s disease’. ApoE variants with Cys 
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Figure 3 | Base editing in human cells. a, Possible base editing outcomes 
in mammalian cells. Initial editing results in a U:G mismatch. Recognition 
and excision of the U by uracil DNA glycosylase (UDG) initiates base 
excision repair (BER), which leads to reversion to the C:G starting state. 
BER is impeded by BE2 and BE3, which inhibit UDG. The U:G mismatch 
is also processed by mismatch repair (MMR), which preferentially 

repairs the nicked strand of a mismatch. BE3 nicks the non-edited strand 
containing the G, favouring resolution of the U:G mismatch to the desired 
U:A or T:A outcome. b, HEK293T cells were treated as described in the 
Methods. The percentage of total DNA sequencing reads with Ts at the 
target positions indicated are shown for treatment with BE1, BE2, or BE3, 
or for treatment with wild-type Cas9 with a 200-nt donor HDR template. 
c, Frequency of indel formation (see Methods) is shown following the 
treatment in b. Values are listed in Extended Data Table 1. For b and c, 
values and error bars reflect the mean and s.d. of three independent 
biological replicates performed on different days. 


residues at these positions, including APOE2 (Cys112 and Cys158), 
APOE3 (Cys112 and Arg158), and APOE3r (Arg112 and Cys158) have 
been shown or are presumed” to confer lower Alzheimer’s disease 
risk than APOE4. We attempted to convert APOEF4 into APOE3r in 
immortalized mouse astrocytes in which the endogenous Apoe gene 
was replaced by human APOE4. We delivered into these astrocytes by 
nucleofection DNA encoding BE3 and an appropriate sgRNA placing 
the target C at position 5 relative to a downstream PAM. After two 
days, we isolated nucleofected cells and measured editing efficiency by 
HTS of genomic DNA. We observed conversion of Arg158 to Cys158 
in 58-75% of total DNA sequencing reads (Fig. 4a and Extended Data 
Fig. 8a). We also observed 36-50% editing of total DNA at the third 
position of codon 158 and 38-55% editing of total DNA at the first 
position of Leu159, as expected since all three of these Cs are within 
the base editing window. Neither of the other two C—T conversions, 
however, alters the amino acid sequence of the ApoE3r protein, as both 
TGC and TGT encode Cys (all CT changes at the third position ofa 
codon are silent), and both CTG and TTG encode Leu. 
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a Untreated Lys Arg Leu Ala Val Tyr Gin Indel % 
APog4acissR | G | AlTAT] Gi ol] a@{[cf{[c][t]a{]ajf[tc{iTa;la]TtT{[_GaG/{][TtT]TaAfTcf{c]al]a la 0.0 
A 0.0 100.0 100.0 00 00 00 00 00 00 00 00 00 1000 00 00 00 00 100.0 00 0.0 100.0 0.0 0.0 
Cc 0.060.000 «€6—00.0 0.0 RRR] 0.0 BRR «0.0 O00 0.0 BR 00 00 O00 OO O00 0.0 i 00 O00 0.0 
G 100.0 0.00 0.0 100.0 0.00 1000 0.0 00 00 100.0 1000 00 00 100.0 00 1000 00 00 00 00 0.0 99.9 100.0 
T 00 00 00 00 00 00 00 00 1000 00 00 00 00 0.0 100.00 00 1000 00 00 00 00 00 0.0 
BES treated Lys Arg > Cys Leu > Leu Ala Val Tyr Gin Indel % 
APOE4C158R | G | A | A | G[CG |G [ c/c][T]a@{|ajf[cjfTala]TtT{[_a[T]aA]Tc]c]f]a]a]|] Ga] 4e 
A 0.1 BR C—0.0 Ul0hCl0 dS 609) 600 )6ClsCis—i. a. CiCisti‘i ik SR O.O 8 Cl. S.StsiéOO 
Cc 0.0 00 O00 O00 23.7 0.0 474 435 00 00 00 999 00 00 00 00 00 0.0 100.0 1000 00 00 0.0 
G 99.9 0.0 0.0 100.0 09 99.9 ) 0.0 100.0 100.0 00 0.0 1000 0.0 1000 00 00 00 00 0.0 100.0 99.9 
T 0.0 0.0 0.0 0.0 (RR 0.1 550 1000 00 00 O01 00 00 1000 0.0 1000 00 00 00 00 00 00 
Cas9 + HDR Lys Arg > Cys Leu Ala Val Tyr Gin Indel % 
APog4cissR | G | ATA] Gi[ol]a@{[cfcfTt]@{]af[cjTa;la]tT{[_a[{[T]aATc]cj]a] af Ga 261 
A 0.0 100.0 1000 00 00 00 00 00 00 00 00 00 994 00 00 00 0.0 100.0 00 0.0 100.0 0.0 0.0 
Cc 00 00 00 00 99.7 00 999 999 00 00 00 1000 05 00 00 00 00 0.0 100.0 100.0 00 00 £O.0 
G 100.0 0.00 0.0 100.0 0.00 1000 00 00 00 1000 999 00 00 996 06 999 02 00 00 0.0 0.0 100.0 100.0 
T 00 00 00 00 03 00 01 O71 1000 00 00 00 01 O04 993 01 998 00 00 00 00 00 0.0 


b Untreated 


Po Arg | Ala Sf Met S| A Se CS 


Cys 


Indel % 


C 


TP53 Y163C 0.0 
A 00.0600 )6€©6000 00 00 00 0.0 Bi 00 00 O01 00 0.0 BR 00 00 OO O00 0.0 EIRRRIREI 00 0.0 
Cc 100.0 100.0 0.0 100.0 0.0 100.0 100.00 00 00 0.0 0.0 100.0 1000 00 00 1000 00 00 1000 00 00 0.0 100.0 
G 0.0 0.0 —RR 0.0 He 00 O00 OO 0.0 SEE 00 00 00 00 00 00 Ea 00 0.0 0.0 —RR 0.0 
t 0.0 00 00 00 00 00 00 00 1000 00 00 00 00 0.0 1000 00 1000 00 00 00 00 00 0.0 
BES treated Arg Ala Met Ala lle Cys > Tyr Lys Indel % 
Tas yiesc [Cc [cc] aGa]c[a]cftTc;]altTtTiy]@{; al cj[c/;al;lt{[_c{[ tT] [/q]c]|a]fla]eal|ec 0.7 
A 00 O00 O00 O00 O1 OO O00 1000 00 00 01 O00 O00 1000 00 00 00 76 0.0 100.0 100.0 0.0 0.0 
Cc 100.0 100.0 0.0 100.0 0.0 100.0 100.0 00 00 00 00 999 100.0 0.0 00 1000 00 04 100.0 0.0 0.0 0.0 100.0 
G 0.0 0.0 ERR 0.0 He 00 «#00 OO 0.0 SERRE 00 00 00 00 00 0.0 —RRN 0.0 O.0 0.0 (RR 0.0 
tT 0.00600006«©6000:«©600.0 «600 06«60006«6(0.0)S (0.0 RR CO. SC 0CiTsCi.ts«=i 0s i. CC CisiCitsé 
Cas9 + HDR Arg Ala Met Ala lle Cys > Tyr Lys Indel % 
Tesyiesc [CC | Cc] G][c[a@][c]c;]alTtT{i{a/]ajlocolTcj{l, al;lTtT{[_c]/Tt[{[a@]c/]alalal cy] 61 
A 00 00 O00 00 00 00 0.0 1000 00 00 00 00 00 1000 00 00 00 00 00 99.9 100.0 0.0 0.0 
Cc 100.0 100.0 0.0 100.0 0.0 100.0 100.00 00 00 0.0 0.0 100.0 1000 00 00 1000 00 00 1000 00 00 0.0 100.0 
G 0.0 0.0 100.0 0.0 100.0 00 00 00 00 1000 999 00 00 00 00 00 00 100.0 00 00 00 100.0 0.0 
T 00 00 00 00 00 00 00 00 1000 00 00 00 00 0.0 1000 00 1000 00 00 00 00 00 0.0 


Figure 4 | BE3-mediated correction of two disease-relevant mutations 
in mammalian cells. The sequence of the protospacer is shown to the 
right of the mutation, with the PAM in blue and the target base in red 
with a subscripted number indicating its position within the protospacer. 
Underneath each sequence are the percentages of total sequencing 

reads with the corresponding base. Cells were treated as described in 

the Methods. a, The Alzheimer’s disease-associated APOE4 allele was 
converted to APOE3r in mouse astrocytes by BE3 in 74.9% of total reads. 


The efficiency of BE3-mediated editing of APOE4 demonstrates 
that a combination of suppressing BER and guiding MMR to repair 
the unedited strand enables base editing efficiencies to exceed the 50% 
maximum yield that would result from DNA replication alone. We 
observed no evident increase in mutations within 50 bp of either end 
of the protospacer compared with untreated controls (Supplementary 
Table 6). We observed 4.6-6.1% indels at the targeted locus following 
BE3 treatment. In contrast, identical treatment of astrocytes with wild- 
type Cas9 and donor ssDNA resulted in 0.1-0.3% APOEF4 correction 
and 26-40% indels at the targeted locus, efficiencies consistent with 
previous reports of single-base correction using Cas9 and HDR°® 
(Fig. 4a and Extended Data Fig. 8a). Astrocytes treated identically 
but with an sgRNA targeting the VEGFA locus displayed no evidence 
of APOE4 base editing (Supplementary Table 6 and Extended Data 
Fig. 8a). These results demonstrate that base editors can mediate 
highly efficient and precise single amino acid changes in the cod- 
ing sequence of a protein, even when their processivity results in >1 
nucleotide change in genomic DNA. 

The dominant-negative p53 mutation Tyr163Cys is strongly asso- 
ciated with several types of cancer?’ and can be corrected by a C+T 
conversion on the template strand (Extended Data Fig. 2), resulting 
in the translation of corrected protein even before the edited base is 
made permanent by DNA replication or DNA repair. We nucleofected 
a human breast cancer cell line homozygous for the p53 Tyr163Cys 
mutation (HCC1954 cells) with DNA encoding BE3 and an sgRNA 
programmed to correct Tyr163Cys. We observed correction of the 
Tyr163Cys mutation in 3.3-7.6% of nucleofected HCC1954 cells 
(Fig. 4b, Extended Data Fig. 8b, and Supplementary Table 7), with 
< 0.7% indel formation. In contrast, treatment of cells with wild-type 


Two nearby Cs are also converted to Ts, but with no change to the 
predicted sequence of the resulting protein. Identical treatment of these 
cells with wild-type Cas9 and a 200-nt ssDNA donor results in 0.3% 
correction, with 26.1% indel formation. b, The cancer-associated p53 
Y163C mutation is corrected by BE3 in 7.6% of nucleofected human breast 
cancer cells with 0.7% indel formation. Identical treatment of these cells 
with wild-type Cas9 and donor ssDNA results in no mutation correction 
with 6.1% indel formation. 


Cas9 and donor ssDNA resulted in no detectable TP53 correction 
(<0.1%) with 6.1-8.0% indels at the target locus (Fig. 4b and Extended 
Data Fig. 8b). These results collectively represent the correction of 
disease-associated point mutations in mammalian cell lines with an 
efficiency and lack of other genome modification events that may 
not be achievable using previously described methods. An additional 
300-900 clinically relevant known human genetic diseases that in 
principle are correctable by the base editors described in this work 
are shown in Extended Data Fig. 9 and Supplementary Table 8 (see 
Supplementary Information). 

The development of base editing advances both the scope and 
effectiveness of genome editing. The base editors described here 
offer researchers a choice of editing with very little (<0.1%) indel 
formation (BE2), or more efficient editing with <1% indel formation 
(BE3). That the product of base editing is, by definition, no longer a 
substrate likely contributes to editing efficiency by preventing sub- 
sequent product transformation, which can hamper traditional Cas9 
applications. By removing the reliance on dsDNA cleavage, donor 
templates, and stochastic DNA repair processes that vary by cell state 
and cell type, base editing has the potential to expand the type of 
genome modifications that can be cleanly installed, the efficiency 
of these modifications, and the type of cells that are amenable to 
editing. It is likely that engineered Cas9 variants”®** or delivery 
methods”? that offer improved DNA specificity or altered PAM spe- 
cificities*° can provide additional base editors with improved prop- 
erties. These results also suggest architectures for the fusion of other 
DNA-modifying enzymes, including methylases and demethylases, 
that may enable additional types of programmable genome and epig- 
enome base editing. 
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Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The investigators were not blinded to allocation during experiments and outcome 
assessment. 

Cloning. DNA sequences of all substrates and primers used in this paper are listed 
in the Supplementary Information. PCR was performed using VeraSeq ULtra 
DNA polymerase (Enzymatics), or Q5 Hot Start High-Fidelity DNA Polymerase 
(New England Biolabs). BE plasmids were constructed using USER cloning (New 
England Biolabs). Deaminase genes were synthesized as gBlocks Gene Fragments 
(Integrated DNA Technologies), and Cas9 genes were obtained from previously 
reported plasmids*!. Deaminase and fusion genes were cloned into pCMV 
(mammalian codon-optimized) or pET28b (E. coli codon-optimized) backbones. 
sgRNA expression plasmids were constructed using site-directed mutagenesis. 
Briefly, the primers listed in the Supplementary Information were 5’ phospho- 
rylated using T4 Polynucleotide Kinase (New England Biolabs) according to the 
manufacturer's instructions. Next, PCR was performed using Q5 Hot Start High- 
Fidelity Polymerase (New England Biolabs) with the phosphorylated primers and 
the plasmid pFYF1320 (EGFP sgRNA expression plasmid) as a template accord- 
ing to the manufacturer’s instructions. PCR products were incubated with DpnI 
(20 U, New England Biolabs) at 37°C for 1h, purified on a QIAprep spin column 
(Qiagen), and ligated using QuickLigase (New England Biolabs) according to 
the manufacturer’s instructions. DNA vector amplification was carried out using 
Mach1 competent cells (ThermoFisher Scientific). 

In vitro deaminase assay on ssDNA. Sequences of all ssDNA substrates are listed 
in the Supplementary Information. All Cy3-labelled substrates were obtained from 
Integrated DNA Technologies (IDT). Deaminases were expressed in vitro using 
the TNT T7 Quick Coupled Transcription/Translation Kit (Promega) according 
to the manufacturer's instructions using 1 \.g of plasmid. Following protein expres- 
sion, 5 ul of lysate was combined with 35 yl of ssDNA (1.8,1.M) and USER enzyme 
(1 unit) in CutSmart buffer (New England Biolabs) (50mM potassium acetate, 
29 mM Tris-acetate, 10 mM magnesium acetate, 100,1gml~! BSA, pH 7.9) and 
incubated at 37°C for 2h. Cleaved U-containing substrates were resolved from 
full-length unmodified substrates on a 10% TBE-urea gel (Bio-Rad). 

Expression and purification of Hiss-rAPOBEC1-linker-dCas9 fusions. E. coli 
BL21 STAR (DE3)-competent cells (ThermoFisher Scientific) were transformed 
with plasmids encoding pET28b-Hiss-rAPOBEC1-linker-dCas9 with GGS, 
(GGS)3, XTEN, or (GGS)7 linkers. The resulting expression strains were grown 
overnight in Luria-Bertani (LB) broth containing 100,gm1~' of kanamycin at 37 °C. 
The cells were diluted 1:100 into the same growth medium and grown at 37°C to 
OD¢00 = ~0.6. The culture was cooled to 4°C over a period of 2h, and isopropyl- 
B-p-1-thiogalactopyranoside (IPTG) was added at 0.5mM to induce protein 
expression. After ~16h, the cells were collected by centrifugation at 4,000g and 
resuspended in lysis buffer (50 mM tris(hydroxymethyl)-aminomethane (Tris) - 
HCl (pH 7.5), 1M NaCl, 20% glycerol, 10 mM tris(2-carboxyethyl) phosphine 
(TCEP, Soltec Ventures)). The cells were lysed by sonication (20s pulse-on, 20s 
pulse-off for 8 min total at 6 W output) and the lysate supernatant was isolated 
following centrifugation at 25,000g for 15 min. The lysate was incubated with His- 
Pur nickel-nitriloacetic acid (nickel-NTA) resin (ThermoFisher Scientific) at 4°C 
for 1h to capture the His-tagged fusion protein. The resin was transferred to a 
column and washed with 40 ml of lysis buffer. The His-tagged fusion protein was 
eluted in lysis buffer supplemented with 285 mM imidazole, and concentrated by 
ultrafiltration (Amicon-Millipore, 100-kDa molecular weight cut-off) to 1 ml total 
volume. The protein was diluted to 20 ml in low-salt purification buffer contain- 
ing 50 mM tris(hydroxymethyl)-aminomethane (Tris )- HCl (pH 7.0), 0.1 M NaCl, 
20% glycerol, 10 mM TCEP and loaded onto SP Sepharose Fast Flow resin (GE 
Life Sciences). The resin was washed with 40 ml of this low-salt buffer, and the 
protein eluted with 5 ml of activity buffer containing 50 mM tris(hydroxymethy]l)- 
aminomethane (Tris)-HCl (pH 7.0), 0.5 M NaCl, 20% glycerol, 10 mM TCEP. The 
eluted proteins were quantified by SDS-PAGE. 

In vitro transcription of sgRNAs. Linear DNA fragments containing the T7 pro- 
moter followed by the 20-bp sgRNA target sequence were transcribed in vitro using 
the primers listed in the Supplementary Information with the TranscriptAid T7 
High Yield Transcription Kit (ThermoFisher Scientific) according to the man- 
ufacturer’s instructions. sgRNA products were purified using the MEGAclear 
Kit (ThermoFisher Scientific) according to the manufacturer’s instructions and 
quantified by UV absorbance. 

Preparation of Cy3-conjugated dsDNA substrates. Sequences of 80-nt unla- 
belled strands are listed in the Supplementary Information and were ordered as 
PAGE-purified oligonucleotides from IDT. The 25-nt Cy3-labelled primer listed 
in the Supplementary Information is complementary to the 3’ end of each 80-nt 
substrate. This primer was ordered as an HPLC-purified oligonucleotide from IDT. 
To generate the Cy3-labelled dsDNA substrates, the 80-nt strands (5,11 of a 100 1M 
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solution) were combined with the Cy3-labelled primer (5 11 of a 100|1M solution) 
in NEBuffer 2 (38.25 ul of a50 mM NaCl, 10 mM Tris-HCl, 10 mM MgCh, 1mM 
DTT, pH 7.9 solution, New England Biolabs) with dNTPs (0.75 ul of a 100 mM 
solution) and heated to 95°C for 5 min, followed by a gradual cooling to 45°C at 
a rate of 0.1°C per s. After this annealing period, Klenow exo” (5 U, New England 
Biolabs) was added and the reaction was incubated at 37°C for 1h. The solution 
was diluted with buffer PB (25011, Qiagen) and isopropanol (50 yl) and purified 
on a QIAprep spin column (Qiagen), eluting with 5011 of Tris buffer. 
Deaminase assay on dsDNA. The purified fusion protein (20 ul of 1.9\.M in 
activity buffer) was combined with 1 equivalent of appropriate sgRNA and incu- 
bated at ambient temperature for 5 min. The Cy3-labelled dsDNA substrate was 
added to final concentration of 125nM and the resulting solution was incubated 
at 37°C for 2h. The dsDNA was separated from the fusion by the addition of 
buffer PB (100 ul, Qiagen) and isopropanol (25 1) and purified on a EconoSpin 
micro spin column (Epoch Life Science), eluting with 20 tl of CutSmart buffer 
(New England Biolabs). USER enzyme (1 U, New England Biolabs) was added 
to the purified, edited dsDNA and incubated at 37°C for 1h. The Cy3-labelled 
strand was fully denatured from its complement by combining 5 yl of the reaction 
solution with 15 jl of a DMSO-based loading buffer (5 mM Tris, 0.5 mM EDTA, 
12.5% glycerol, 0.02% bromophenol blue, 0.02% xylene cyan, 80% DMSO). The 
full-length C-containing substrate was separated from any cleaved, U-containing 
edited substrates on a 10% TBE-urea gel (Bio-Rad) and imaged on a GE Amersham 
Typhoon imager. 

Preparation of in vitro-edited dsDNA for high-throughput sequencing. The 
oligonucleotides listed in the Supplementary Information were obtained from 
IDT. Complementary sequences were combined (5 ul of a 100 1M solution) in 
Tris buffer and annealed by heating to 95°C for 5 min, followed by a gradual cool- 
ing to 45°C at a rate of 0.1°C per s to generate 60-bp dsDNA substrates. Purified 
fusion protein (2011 of 1.9\.M in activity buffer) was combined with 1 equiva- 
lent of appropriate sgRNA and incubated at ambient temperature for 5 min. The 
60-mer dsDNA substrate was added to final concentration of 125nM and the 
resulting solution was incubated at 37°C for 2h. The dsDNA was separated from 
the fusion by the addition of buffer PB (100,11, Qiagen) and isopropanol (25 11) 
and purified on a EconoSpin micro spin column (Epoch Life Science), eluting 
with 201l of Tris buffer. The resulting edited DNA (1 11 was used as a template) 
was amplified by PCR using the high-throughput sequencing primer pairs speci- 
fied in the Supplementary Information and VeraSeq Ultra (Enzymatics) according 
to the manufacturer’s instructions with 13 cycles of amplification. PCR reaction 
products were purified using RapidTips (Diffinity Genomics), and the purified 
DNA was amplified by PCR with primers containing sequencing adapters, puri- 
fied, and sequenced on a MiSeq high-throughput DNA sequencer (Illumina) as 
previously described’. 

Cell culture. HEK293T (ATCC CRL-3216) and U2OS (ATCC HTB-96) were 
maintained in Dulbecco's Modified Eagle’s Medium plus GlutaMax (ThermoFisher) 
supplemented with 10% (v/v) fetal bovine serum (FBS), at 37°C with 5% CO. 
HCC1954 cells (ATCC CRL-2338) were maintained in RPMI-1640 medium 
(ThermoFisher Scientific) supplemented as described above. Immortalized 
mouse astrocytes containing the APOE4 isoform of the APOE gene (Taconic 
Biosciences) were cultured in Dulbecco's Modified Eagle’s Medium plus GlutaMax 
(ThermoFisher Scientific) supplemented with 10% (v/v) fetal bovine serum (FBS) 
and 200g ml~! Geneticin (ThermoFisher Scientific). 

Transfections. HEK293T cells were seeded on 48-well collagen-coated BioCoat 
plates (Corning) and transfected at approximately 85% confluency. Briefly, 
750 ng of BE and 250 ng of sgRNA expression plasmids were transfected using 
1.5 ul of Lipofectamine 2000 (ThermoFisher Scientific) per well according to the 
manufacturer’s protocol. 

Astrocytes, U2OS, HCC1954 and HEK293T cells were transfected using appro- 
priate Amaxa Nucleofector II programs according to manufacturer's instructions 
(basic glial cell, V, V, and V kits using programs T-020, X-001, X-005, and Q-001 
for astrocytes, U2OS, HCC1954, and HEK293T cells, respectively). 40 ng of 
iRFP670 (Addgene plasmid 45457)°° was added to the nucleofection solution to 
assess nucleofection efficiencies in these cell lines. Astrocytes and HCC1954 cells 
were filtered through a 401m strainer (Fisher Scientific) after harvesting, and the 
nucleofected cells were collected on a Beckman Coulter MoFlo XDP Cell Sorter 
using the iRFP signal (absorbance 643 nm, emission 670 nm). The U2OS and 
HEK293T cells were used without enrichment of nucleofected cells. 
High-throughput DNA sequencing of genomic DNA samples. Transfected cells 
were harvested after 3 days and the genomic DNA was isolated using the Agencourt 
DNAdvance Genomic DNA Isolation Kit (Beckman Coulter) according to the 
manufacturer’s instructions. On-target and off-target genomic regions of interest 
were amplified by PCR with flanking high-throughput sequencing primer pairs 
listed in the Supplementary Information. PCR amplification was carried out with 
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Phusion high-fidelity DNA polymerase (ThermoFisher) according to the manu- 
facturer’s instructions using 5 ng of gnomic DNA as a template. Cycle numbers 
were determined separately for each primer pair as to ensure the reaction was 
stopped in the linear range of amplification (30, 28, 28, 28, 32, and 32 cycles for 
EMX1, FANCF, HEK293 site 2, HEK293 site 3, HEK293 site 4, and RNF2 primers, 
respectively). PCR products were purified using RapidTips (Diffinity Genomics). 
Purified DNA was amplified by PCR with primers containing sequencing adap- 
tors. The products were gel purified and quantified using the Quant-iT PicoGreen 
dsDNA Assay Kit (ThermoFisher) and KAPA Library Quantification Kit-Illumina 
(KAPA Biosystems). Samples were sequenced on an Illumina MiSeq as previously 
described’. 

Data analysis. Sequencing reads were automatically demultiplexed using MiSeq 
Reporter (Illumina), and individual FASTQ files were analysed with a custom 
Matlab script provided in the Supplementary Information. Each read was pair- 
wise aligned to the appropriate reference sequence using the Smith-Waterman 
algorithm. Base calls with a Q-score below 31 were replaced with Ns and were thus 
excluded in calculating nucleotide frequencies. This treatment yields an expected 
MiSeq base-calling error rate of approximately 1 in 1,000. Aligned sequences in 


which the read and reference sequence contained no gaps were stored in an align- 
ment table from which base frequencies could be tabulated for each locus. 

Indel frequencies were quantified with a custom Matlab script shown in the 
Supplementary Information using previously described criteria”®. Sequencing 
reads were scanned for exact matches to two 10-bp sequences that flank both sides 
of a window in which indels might occur. Ifno exact matches were located, the read 
was excluded from analysis. If the length of this indel window exactly matched the 
reference sequence the read was classified as not containing an indel. If the indel 
window was two or more bases longer or shorter than the reference sequence, 
then the sequencing read was classified as an insertion or deletion, respectively. 


31. Mali, P. et al. CAS9 transcriptional activators for target specificity screening and 
paired nickases for cooperative genome engineering. Nature Biotechnol. 31, 
833-838 (2013). 

32. Pattanayak, V. et a/. High-throughput profiling of off-target DNA cleavage 
reveals RNA-programmed Cas9 nuclease specificity. Nature Biotechnol. 31, 
839-843 (2013). 

33. Shcherbakova, D. M. & Verkhusha, V. V. Near-infrared fluorescent proteins for 
multicolor in vivo imaging. Nature Methods 10, 751-754 (2013). 


© 2016 Macmillan Publishers Limited. All rights reserved 


a > = 
2» & 9 
« & wo 
S & PR 
1 (ep) xt 
Y) rs 
0 9 & § 
- Q + ®D Bs 
O i tu O O Oo e) 
uw x a uw uw > ‘ 
a oO oz) jaa) jaa) (eo) S 
O Oo a 5S Oo Oo O 8 € 
Oo. = O ial Oo. oO. {a} oO 
< & € 2 82 €¢€ € 8 8B 
C-containing 
i 
Go quures Boned = aa substrates 
cleaved 
— @ —s——-__ U-containing 
products 


Extended Data Figure 1 | Effects of deaminase, linker length, and linker 
composition on base editing. a, Gel-based deaminase assay showing 
activity of rat APOBEC1 (rAPOBEC1), lamprey CDA1 (pmCDA1), 
human AID (hAID), human APOBEC3G (hAPOBEC3G), rAPOBEC1- 
GGS-dCas9, rAPOBEC1-(GGS)3-dCas9, and dCas9-(GGS)3-rAPOBEC1 
on ssDNA. Enzymes were expressed in a mammalian cell lysate-derived 
in vitro transcription-translation system and incubated with 1.8 14M 
dye-conjugated ssDNA and USER enzyme (uracil DNA glycosylase and 
endonuclease VIII) at 37°C for 2h. The resulting DNA was resolved on 
a denaturing polyacrylamide gel and imaged. The positive control is a 
sequence with a U synthetically incorporated at the same position as the 
target C. b, Coomassie-stained denaturing PAGE of the expressed and 
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purified proteins used in c-f. c-f, Gel-based deaminase assay showing the 
deamination window of base editors with deaminase-Cas9 linkers of GGS 
(c), (GGS)3 (d), XTEN (e), or (GGS), (f). Following incubation of 1.85 1M 
deaminase-dCas9 fusions complexed with sgRNA with 125 nM dsDNA 
substrates at 37°C for 2h, the dye-conjugated DNA was isolated and 
incubated with USER enzyme at 37°C for 1h to cleave the DNA backbone 
at the site of any Us. The resulting DNA was resolved on a denaturing 
polyacrylamide gel, and the dye-conjugated strand was imaged. Each 

lane is numbered according to the position of the target C within the 
protospacer, or labelled with ‘—’ if no target C is present. 8U is a positive 
control sequence with a U synthetically incorporated at position 8. 

For uncropped gel data, see Supplementary Fig. 1. 
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a 
APOE4 Cys112Arg: 5'-GGAGGACGTGC,,GCGGCCGCCTGG 
APOE4 Cys158Arg: 9-GAAGC;,GCCT GGCAGTGTACCAGG 
CTNNB1 Thr41Ala: 5’-CTGTGGC,AGT GGCACCAGAAT GG 
HRAS GIn61Arg: 5’-CCTCCC,GGCCGGCGGTATCCAGG 
P53 Tyr163Cys: 9 -GCTTGC,AGAT GGCCATGGCGCGG 
P53 Tyr236Cys: 9 -ACACATGC,AGTT GTAGTGGAT GG 
b P53 Asn239Asp: 5-TGTC,ACACATGTAGTTGTAGT GG 
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Extended Data Figure 2 | BE] is capable of correcting disease-relevant 
mutations in vitro. a, Protospacer and PAM sequences (blue) of seven 
disease-relevant mutations. The disease-associated target C in each case 
is indicated with a subscripted number reflecting its position within the 
protospacer. For all mutations except both APOE4 SNPs, the target C 
resides in the template (non-coding) strand. b, Deaminase assay showing 
each dsDNA 80-mer oligonucleotide before (—) and after (+) incubation 
with BE1, DNA isolation, and incubation with USER enzymes to cleave 


DNA at positions containing U. Positive control lanes from incubation 
of synthetic oligonucleotides containing U at various positions within 
the protospacer with USER enzymes are shown with the corresponding 
number indicating the position of the U. Editing efficiencies were 
quantitated by dividing the intensity of the cleaved product band by 
that of the entire lane for each sample. For uncropped gel data, 

see Supplementary Fig. 1. 
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protospacer and PAM sequence: 5'-TTCCCCCCCCGATTTAT TTATGG-3’ 


sequence % of total reads 

wha... 62.4 

wo VED) 0 tucheees. 18.2 

FT TTTITTIC... 13.4 

oT TUTI I Ds 3.3 

was T Agpherharleg orl er--. 0.8 

..CCCCTTCC... 0.3 

soltetater | TT Leelee: 0.3 

oct LET TGkatee:. 0.3 

_.GCCCTCGE... 0.3 
Extended Data Figure 3 | Processivity of BE1. The protospacer and PAM _ throughput sequencing. Shown are the percent of total reads for the most 
(blue) of a 60-mer DNA oligonucleotide containing eight consecutive Cs frequent nine sequences observed. The vast majority of edited strands 
is shown at the top. The oligonucleotide (125 nM) was incubated with (>93%) have more than one C converted to T. 


BEI (21M) for 2h at 37°C. The DNA was isolated and analysed by high- 
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a 
EMX1: GAGTC,C,;GAGCAGAAGAAGAAGGG 
FANCEF: GGAATE,O: TI TGCAGCACCTGG 
HEK293 site 2: GAAC,AC,AAAGCATAGACTGCGGG 
HEK293 site 3: GGCC,C;,;AGACTGAGCACGTGATGG 
HEK293 site 4: GGCAC.TGCGGCTGGAGGTCCGGG 
' RNF2: GTC,ATC,TTAGTC,,ATTACCTGAGG 


EMX1 G A | 6] Tf CG: | Ce | G | A! G C A | G A A A A G [AIA G |G/cG] 
1.85uMBE1 |A]| 0.2 0.1 0.1 0.1 0.0 03 0.1 0.1 0.2 0.2 0.1 O02 02 
0.0 O00 O00 O00 604 6140 00 00 00 991 00 0.0 0.0 0.2 00 00 00 00 20 00 00 0.0 


|c| 0. 
|G|99.7 00 999 00 00 01 00 997 00 00 | 997 0.0 
[T] 01 0.0 0.0 39.5 07 00 00 00 00 01 400 


C7 Cg T T Ci T G Cc A G 
0.1 0.1 0.0 0.1 0.1 0.1 0.2 
6457 65.0 5. : 0.0 aa 0.0 
0.0 0.0 0.0 0.0 


99.9 100.0 0. j 0.1 O01 0.1 : ; 
0.1 0.0 0.0 0.0 00 999 00 00 00 
0.0 998 00 00 00 998 00 999 998 99.8 
0.0 0.1 0.0 00 '999 00 0.1 


Ic| 00 0.0 01 M806! 0.0 M769 00 00 00 00 (eo6 
|G|/998' 00 00 00 00 00 00 00 00 0.0 0.0 
[T| 01 00 00 '493° 00 229) 00 00 O00 O01 O03 0.0 


0.1 39.9 : 99.9 99.5 
| 0.0 0.0 (96 6 00 00 0.0 
0 00 00 00 0.0 00 00 998 0.0 
7.7 eee SCoo.o 0.0 0.0 3.3 999 01 0.0 


100.C : 0.0 
0.0 | 60.1 0.0 97.C 99.4 
0.0 O00 40.0 eral 0.0 eal 0.0 
0.0 | 3918 100.0 0.1 29 00 O01 405 


ie 0.0 100.C 0.1 O00 O01 0.1 
00 O00 00 O00 00 00 00 00 00 00 00 0.0 
0.0 99.7 0.0 998 998 0.1 998 998 99.8 99.7 
00.0 0.0 0.1 0.0 0.0 00 999 00 01 O01 #00 00 


RNF2 
1.85 uM BE1 99.9 0.0 99.9 99 ¢ 
|C| 0.0 0.0 )594) 0.0 oe 57.8 ie 0.0 01 00 00 864 0.0 0.0 0.1 
|G|/999 01 00 0.0 0.0 00 00 998 00 00 00 00 00 0. 99.8 
|[T| 0.0 999 '408 0.0 isooral 42.1 GEE GEE 0.0 01 999 134 0.0 0.0 00 00 (99.9% 01 00 01 00 
Cc 
40% @ BE? in vitro 
a) 
SE @ untreated HEK293T cells 
® 
“- O . 
Pe 30% @ BE1 in HEK293T cells 
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ie) 
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Extended Data Figure 4 | BE1 base editing efficiencies are strikingly if the in vitro conversion efficiency is >10%. Note that maximum yields 
decreased in mammalian cells. a, Protospacer (black and red) and PAM are 50% of total DNA sequencing reads since the non-targeted strand 
(blue) sequences of the six mammalian cell genomic loci targeted by is unaffected by BE1. Values are shown from a single experiment. 
base editors. Target Cs are indicated in red with subscripted numbers c, HEK293T cells were transfected with plasmids expressing BE1 and an 
corresponding to their positions within the protospacer. b, Synthetic appropriate sgRNA. Three days after transfection, genomic DNA was 
80-mers with sequences matching six different genomic sites were extracted and analysed by high-throughput sequencing at the six loci. 
incubated with BE1 then analysed for base editing by high-throughput Cellular C to T conversion percentages, defined as the percentage of 
sequencing. For each site, the sequence of the protospacer is indicated total DNA sequencing reads with Ts at the target positions indicated, are 
to the right of the name of the site, with the PAM highlighted in blue. shown for BE] at all six genomic loci. Values and error bars of all data 
Underneath each sequence are the percentages of total DNA sequencing from HEK293T cells reflect the mean and standard deviation of three 
reads with the corresponding base. We considered a target C as ‘editable’ independent biological replicates performed on different days. 
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Extended Data Figure 5 | Base editing efficiencies of BE2 in U20OS DNA was extracted and analysed for base editing at the six genomic loci 
and HEK293T cells. Cellular C to T conversion percentages by BE2 are by HTS. Values and error bars reflect the mean and standard deviation of 
shown for each of the six targeted genomic loci in HEK293T cells and two (U2OS) or three (HEK293T) biological experiments performed on 


U20S cells. HEK293T cells were transfected using Lipofectamine 2000, and different days. 
U20S cells were nucleofected. Three days after plasmid delivery, genomic 
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Guntreated pre-passaging 
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Extended Data Figure 6 | Base editing persists over multiple cell One half was subjected to high-throughput sequencing analysis, and the 
divisions. Cellular C to T conversion percentages by BE2 and BE3 are other half was allowed to propagate for approximately five cell divisions, 
shown for HEK293 sites 3 and 4 in HEK293T cells before and after then harvested and subjected to high-throughput sequencing analysis. 
passaging the cells. HEK293T cells were nucleofected with plasmids Values and error bars reflect the mean and standard deviation of two 
expressing BE2 or BE3 and an sgRNA targeting HEK293 site 3 or 4. biological experiments performed on different days. 

Three days after nucleofection, the cells were harvested and split in half. 
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b 
non-protospacerC/Gs___ average C/G (%)__ average T/A(%)__ lowest T/A(%)__ highest T/A (% 
untreated 99.95 + 0.14 0.02 + 0.02 0.00 2.44 
BE1 99.95 + 0.24 0.03 + 0.03 0.00 1.64 
BE2 99.95 +0.13 0.03 + 0.03 0.00 1.02 
BE3 99.97 + 0.09 0.02 + 0.02 0.00 2.52 
Extended Data Figure 7 | Non-target C/G mutation rates. Shown against their positions relative to a protospacer for all 3,200 cytosines/ 
here are the C to T and G to A mutation rates at 3,200 distinct cytosines guanines. The side of the protospacer distal to the PAM is designated 
and guanines surrounding the six on-target and 34 off-target loci with positive numbers, while the side that includes the PAM is designated 
tested, representing a total of 14,700,000 sequence reads derived from with negative numbers. b, Average non-target cellular C to T and Gto A 
approximately 1.8 x 10° cells. a, Cellular non-target C to T and Gto A conversion percentages by BE1, BE2, and BE3 are shown, as well as the 
conversion percentages by BE1, BE2, and BE3 are plotted individually highest and lowest individual conversion percentages. 
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a 
Untreated Lys | Arg Leu Ala Val Tyr Gin indel % 
APOE4 C158R G A A G | c | G C é | tT | ¢ G c | A G T G T | Alc cl] al Gli ec 0.0 
A 0.0 00 O00 O00 00 00 00 00 00 £400 00 OO O00 00 4000 00 00 1000 00 00 
Cc 0.0 0.0 0.0 0.0 |100.0 0.0 0.0 0.0 0.1 (100.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 
G ‘1400.0 0.0 0.0 999 00 1000 0.0 0.0 0.0 100.0' 99.9 0.0 0.0 100.0 0.0 1000 00 0.0 0.0 0.0 0.0 
T 0.0 0.0 0.0 0.1 0.0 0.0 0.0 0.0 100.0 0.0 0.0 0.0 0.0 0.0 100.0 00 1000 0.0 0.0 0.0 0.0 0.1 0.0 
BE3 + on-target sgRNA Lys | Arg > Cys | Leu > Leu Ala | Val Tyr Gin | indel % 
APOE4 C158R G A A G | Cs G ej/¢e¢ftT]s G | Cc A G T | G T A Cc Cc A G G 6.1 
A 0.1 0.0 10 0.0 16 O6 00 00 O00 £00 00 O00 00 £00 0.0 = @©60.0 0.00 8©=— 0. 
Cc 0.0 0.0 0.0 0.0 391 00 620 613 0.0 0.0 0.0 999 0.0 0.0 0.0 0.0 0.0 0.0 (100.0 100.0 0.0 0.0 0.0 
G (999 0.0 0.0 1000 15 999 07 0.5 0.0 100.0 100.0 0.0 0.0 100.0 00 1000 0.0 0.0 0.0 0.0 0.0 (100.0 99.9 
T 0.0 O00 00 00 583 O01 357 375 1000 00 00 101 0.0 00 (100.0 00 1000 0.0 0.0 00 £00 0.0 0.0 
BES + off-target sgRNA Lys Arg Leu Ala Val Tyr Gin indel % 
APOE4 C158R G A A G Cs G | c c | tT |G G ¢ A G i G T A e fe A G G 0.0 
A 0.0 00 00 O00 O00 00 00 00 00 400 00 O00 O00 400 0.0 @0.0 0.0 860.0 
c 0.0 0.0 0.0 0.0 |999 0.0 100011000 0.0 0.0 0.0 (100.0 0.0 0.0 0.0 0.0 0.0 0.0 | 100.0 100.0 0.0 0.0 0.0 
G ‘400.0 0.0 0.0 100.0 0.0 1000 0.0 0.0 0.0 (100.0 100.0 0.0 0.0 100.0 00 1000 0.0 0.0 0.0 0.0 0.0 | 100.0 100.0. 
T 0.0 0.0 0.0 0.0 0.1 0.0 0.0 0.0 100.0 0.0 0.0 0.0 0.0 0.0 (100.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 
Cas9 + HDR Lys Arg > Cys Leu Ala Val Tyr Gin indel % 
APOE4 C158R G A A G Cs i G = c | tT |G G Cc A G T G T A e Cc A G G_ | 40.1 
A 0.0 00 O00 O00 O04 00 O00 O00 00 £00 0.0 O00 00 £00 0.1 0.5 0.0 @ 0.0 
Cc 0.0 0.3 0.2 0.0 998 00 998 999 0.1 0.1 02 999 18 0.4 0.2 0.3 0.2 05 997 996 07 0.0 0.0 
G '99.7 0.0 0.0 999 00 997 0.1 0.0 0.1 0.0 0.1 986 14 994 02 0.3 0.1 0.0 0.0 
b T 0.2 0.0 0.1 0.1 0.1 0.3 0.1 0.1 998 0.2 0.1 0.1 0.1 1.1 FB o4 BS 02 0.2 0.0 0.5 0.6 0.0 
Untreated Arg Ala Met Ala lle Cys Lys indel % 
TP53 Y163C Cc c ie c G e Cc A T | G G e | & A T C T Ge Cc A A G Cc 0.0 
A 00 00 O00 00 00 00 400 0.0 OO 01 0.0 0.0 00 00 00 00 400 0.0 0.0 
Cc ‘400.0 100.0 0.0 (100.0 0.0 ‘1400.0 0.0 0.0 0.0 0.0 | 100.0 100.0 0.0 0.0 100.0 0.0 0.0 (100.0 0.0 0.0 0.0 
G 0.0 0.0 0.0 0.0 0.0 0.0 0.0 (100.0 999 0.0 0.0 0.0 0.0 0.0 0.0 | 999 0.0 0.0 0.0 (100.0 0.0 
T 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 (100.0 0.0 0.0 0.0 0.0 0.0 100.0 00 1000 0.0 0.0 0.0 0.0 0.0 0.0 
BE3 + on-target sgRNA Arg | Ala | Met Ala lle | Cys>Tyr | Lys indel % 
TP53 Y163C a = as i| ¢i|sefihe] et A T | G G [cle A T | @ | fT = Cc A A G Cc 0.0 
A 00 OO O00 O00 00 O00 00 0.0 00 0.1 0.0 ©3600 00 OO O00 33 £4205 0.0 ©3600 
C 0.0 0.0 100.0 1000 00 00 00 00 |1000 1000 00 00 1000 00 00 995 00 00 400 
G 0.0 0.0 1000 00 999 0.0 0.0 0.0 0.0 100.0 999 0.0 0.0 0.0 0.0 0.0 0.0 | 962 0.0 0.0 0.0 100.0 0.0 
T 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 |100.0 0.0 0.0 0.0 0.0 0.0 1000 00 1000 05 0.0 0.0 0.0 0.0 0.0 
BE3 + off-target sgRNA Arg Ala Met Ala lle Cys Lys indel % 
TP53 Y163C C Cc G c G Cc Gc A T G G Cc Cc A T e I Gs c A A G Cc 0.0 
A 00 00 O00 O00 O00 00 400 0.0 OO 01 0.0 0.0 00 O00 O00 00 400 0.0 0.0 
fe: ‘400.0 1000 00 |999 00 100.0 1000 0.0 0.0 0.0 0.0 1000 999 0.0 0.0 100.0 0.0 0.0 (100.0 0.0 0.0 0.0 | 100.0 
G 0.0 0.0 1000 00 999 0.0 0.0 0.0 0.0 (100.0 999 0.0 0.0 0.0 0.0 0.0 0.0 (100.0 0.0 0.0 0.0 (100.0 0.0 
T 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.1 0.0 100.0 0.0 1000 0.0 0.0 0.0 0.0 0.0 0.0 
Cas9 + HDR Arg Ala Met Ala lle Cys > Tyr Lys indel % 
TP53 Y163C Cc S G C G c Cc A T G G e | ¢ A T c t Ge C A A G c 8.0 
A 00 O00 O00 O00 00 O00 00 0.0 00 01 0.0 ©3600 00 O00 O00 00 £00 0.0 60604 
Cc '100.0 100.0 0.0 (1000 00 100.0 1000 0.0 0.0 0.0 0.0 | 100.0 100.0 0.0 0.0 100.0 0.0 0.0 100.0 0.0 0.0 0.0 996 
G 0.0 0.0 1000 00 999 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 (100.0 0.0 
T 0.0 0.0 0.0 0.0 0.1 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 100.0 00 1000 0.0 0.0 0.0 0.0 0.0 0.0 


Extended Data Figure 8 | Additional data sets of BE3-mediated 
correction of two disease-relevant mutations in mammalian cells. 

For each site, the sequence of the protospacer is indicated to the right of 
the name of the mutation, with the PAM highlighted in blue and the base 
responsible for the mutation indicated in red bold with a subscripted 
number corresponding to its position within the protospacer. The amino 
acid sequence above each disease-associated allele is shown, together 
with the corrected amino acid sequence following base editing in green. 
Underneath each sequence are the percentages of total sequencing reads 
with the corresponding base. Cells were nucleofected with plasmids 
encoding BE3 and an appropriate sgRNA. Two days after nucleofection, 
genomic DNA was extracted from the nucleofected cells and analysed by 


high-throughput sequencing to assess pathogenic mutation correction. 

a, The Alzheimer’s disease-associated APOE4 allele is converted to 
APOE3r in mouse astrocytes by BE3 in 58.3% of total reads only when 
treated with the correct sgRNA. Two nearby Cs are also converted to Ts, 
but with no change to the predicted sequence of the resulting protein. 
Identical treatment of these cells with wild-type Cas9 and a 200-nt ssDNA 
donor results in 0.2% correction, with 26.7% indel formation. b, The 
cancer-associated p53 Y163C mutation is corrected by BE3 in 3.3% of 
nucleofected human breast cancer cells only when treated with the correct 
sgRNA. Identical treatment of these cells with wild-type Cas9 and donor 
ssDNA results in no detectable mutation correction with 8.0% indel 
formation. 
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Extended Data Figure 9 | Genetic variants from ClinVar that, in current base editing technologies. The results were filtered by imposing 
principle, can be corrected by base editing. The NCBI ClinVar database the successive restrictions listed on the left. The x axis shows the number 
of human genetic variations and their corresponding phenotypes (see of occurrences satisfying that restriction and all above restrictions on a 


main text ref. 4) was searched for genetic diseases that can be corrected by _ logarithmic scale. 
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Extended Data Table 1 | Indel formation following treatment of HEK293T cells with BE1, BE2, BE3, or wild-type Cas9 plus a ssDNA 
template for HDR 


EMX1 FANCF HEK293 site 2 HEK293 site 3 HEK293 site 4 RNF2 
indel (% indel (% indel (% indel (% indel (% indel (% 
untreated 0.01+0.00 0.00+0.00 0.00+0.00 0.00+0.00 0.00+0.00 0.00+0.00 
BE1 0.04+0.05 0.1140.13 0.02+0.04 0.00+0.00 0.00+0.00 0.00+0.00 
BE2 0.01+0.00 0.01+0.01 0.09+0.09 0.00+0.00 0.00+0.00 0.00+0.00 
BE3 1.3440.35 1.47+0.93 06240,35 0.914+1.07 0.95+1.64 1.39+0.72 
Cas9 + HDR 2.38+0.89 3.2640.22 7.14+40.96 


Indel frequencies were calculated as described in the Methods following treatment of HEK293T cells with BE1, BE2, and BE3 for all six genomic loci, or with wild-type Cas9 and a ssDNA template for 
HDR at three of the six sites (EMX1, HEK293 site 3, and HEK293 site 4). Values reflect the mean and standard deviation of at least three independent biological replicates performed on different days. 
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Structure of the thermally stable Zika virus 


Victor A. Kostyuchenko!*, Elisa X. Y. Lim! ?*, Shuijun Zhang!*, Guntur Fibriansah!’, Thiam-Seng Ng"?, Justin S. G. Ooi!?, 


Jian Shi? & Shee-Mei Lok!? 


Zika virus (ZIKV), formerly a neglected pathogen, has recently been 
associated with microcephaly in fetuses’, and with Guillian—Barré 
syndrome in adults. Here we present the 3.7 A resolution cryo- 
electron microscopy structure of ZIKV, and show that the overall 
architecture of the virus is similar to that of other flaviviruses. 
Sequence and structural comparisons of the ZIKV envelope (E) 
protein with other flaviviruses show that parts of the E protein 
closely resemble the neurovirulent West Nile and Japanese 
encephalitis viruses, while others are similar to dengue virus 
(DENV). However, the contribution of the E protein to flavivirus 
pathobiology is currently not understood. The virus particle was 
observed to be structurally stable even when incubated at 40°C, 
in sharp contrast to the less thermally stable DENV’. This is also 
reflected in the infectivity of ZIKV compared to DENV serotypes 
2 and 4 (DENV2 and DENV4) at different temperatures. The cryo- 
electron microscopy structure shows a virus with a more compact 
surface. This structural stability of the virus may help it to survive 
in the harsh conditions of semen‘, saliva’ and urine®. Antibodies or 
drugs that destabilize the structure may help to reduce the disease 
outcome or limit the spread of the virus. 

Zika virus (ZIKV), a flavivirus, is thought to be principally transmit- 
ted to humans by the mosquito (Aedes aegypti) vector. Other flaviviri- 
uses include West Nile virus (WNV), Japanese encephalitis virus (JEV), 
dengue virus (DENV) and yellow fever virus (YFV). ZIKV generally 
causes a mild disease. However, when pregnant women are infected 
with ZIKV, there is an increased risk of developing microcephaly in 
the fetus’. 

Retrospective analysis of data collected from a ZIKV outbreak in 
French Polynesia in 2013-2014 showed association of the virus with 
microcephaly’. Here we present the 3.7 A resolution structure of ZIKV 
strain H/PF/2013 isolated during that outbreak’. 

For cryo-electron microscopy (cryoEM) studies, ZIKV was grown 
in the mosquito cell line at 28°C and purified at 4°C by polyethylene 
glycol precipitation, a sucrose cushion, followed by a potassium tartrate 
gradient. The gel analysis of the purified sample suggested it contained 
mostly mature virus (Extended Data Fig. 1). The ZIKV samples were 
incubated at 28 °C, 37°C and 40°C (mimicking high fever) for 30 min, 
before imaging by cryoEM (Fig. 1a). At 28 °C, there were broken and 
shrivelled particles together with some smooth surfaced particles 
(about 500 A in diameter), similar to the compact DENV mature par- 
ticles. Conversely, samples incubated at 37°C and 40°C showed many 
more smooth surfaced particles. The presence of a larger fraction of 
shrivelled particles at 28°C could be due to the exposure of particles to 
high osmolality during purification. We speculate that ZIKV particles 
may expand into smooth surfaced particles when incubated at higher 
temperatures, making the lipid envelope more fluid, and allowing the 
structure to revert to its normal state. Some strains of DENV2 (New 
Guinea C, NGC) undergo notable structural changes from smooth to 
bumpy surfaced particles when incubated at 37 °C (refs 9, 10), whereas 
others (PVP94/07)"' do not. Structurally, DENV4 is considered more 
stable than DENV2 (NGC strain), as it remains as a smooth surfaced 


particle even at 40°C, although considerable aggregation of particles 
occurs’. ZIKV seemed to be even more stable than DENV4, as the 
smooth surfaced particles did not aggregate at 40°C (Fig. la). This 
is consistent with the plaque assay results with ZIKV, DENV2 strains 
(NGC and PVP94/07) and DENV4 that were pre-incubated at 28 °C, 
37°C and 40°C for 30 min. ZIKV retained its infectivity, whereas the 
infectivity of DENV2 strains and DENV4 is significantly reduced 
at higher temperatures (Fig. 1b). The infectivity of ZIKV was only 
slightly reduced when the pre-incubation time was increased to 60 min 
(Fig. 1c). 

The cryoEM structure was reconstructed from the virus sample incu- 
bated at 40°C for 30 min. The 3.7 A resolution cryoEM map (Fig. 2a and 
Extended Data Fig. 2) showed well-resolved densities corresponding 
to the a-helices and B-strands. In addition, many side-chain densi- 
ties were visible (Fig. 2b). The overall structural architecture of ZIKV 
(Fig. 2c) is similar to that of DENV (refs 3, 12-14) and WNV (ref. 15). 
While our paper was under review, a similar ZIKV structure was pub- 
lished'® showing similar features. There are 180 copies of each of the 
E and associated M (‘membrane’) proteins on the virus surface; they 
are anchored in the bilayer lipid membrane via their transmembrane 
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Figure 1 | ZIKV particles are more stable than DENV. a, CryoEM 
micrographs of ZIKV particles incubated at different temperatures. 

b, ZIKV infection in BHK-21 cells is not affected when the virus is 
pre-incubated for 30 min at different temperatures. By contrast, all 
DENV strains have reduced infectivity with increasing temperatures. The 
y axis shows the percentage of plaque-forming units (PFU) at different 
temperatures divided by that of the same virus at 4 °C. c, Increased 
incubation time slightly reduced ZIKV infectivity. Blue and green bars 
indicate 30- and 60-min incubation times, respectively. Two experiments 
were done, and each condition was tested in triplicate. Data are mean 
and s.d.; means were calculated by averaging the triplicates within an 
experiment, and the s.d. shows the variability between the means from two 
separate experiments. 
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Figure 2 | The 3.7 A resolution cryoEM structure of ZIKV. a, The ZIKV 
cryoEM map. The map is coloured radially: 0-219 A (cyan), 220-240 A 
(purple) and >240 A (gold). b, Zoomed-in view of the density map (cyan 
mesh) fitted with the E protein structure. c, The E protein arrangement 


regions. The E protein is the major protein involved in receptor binding 
and fusion. The M protein is a small protein that is hidden under the 
E protein layer. The E and M proteins are organized in icosahedral 
symmetry consisting of 60 repeating units, and each asymmetric unit 
contains three individual E proteins (molecules A, B and C) (Fig. 2c). 
The E proteins are arranged as dimers, with three dimers lying parallel 
to each other forming a raft; there are 30 such rafts covering the viral 
surface. The E ectodomain contains three domains: DI, DII and DIII 
(Fig. 2c, right). DI acts as a bridge between DII and DIII. The tip of DII 
contains the fusion loop, which interacts with the endosomal mem- 
brane during fusion. The DI-DII hinge is thought to be important for 
flipping out DII to expose the fusion loop during the fusion event. The 
DI-DIII hinge is thought to be rigid at neutral pH. In most other flavi- 
viruses, DIII contains the receptor-binding site’” and has an important 
role in fusion’®. After the carboxy-terminal end of the E ectodomain 
is the a-helical stem region that lies on the viral membrane, and the 
o-helical transmembrane region. 

The superposition of the three individual E protein molecules (A, 
B and C) in the asymmetric unit of ZIKV showed high similarity 
(Extended Data Fig. 3a), typical for flaviviruses (Extended Data Fig. 3b). 
Superposition of DI of molecule A of the ZIKV E protein with that of 
DENV?2 (ref. 14) showed a slight difference in the angle at the DI-DIII 
and DI-DII hinges (Extended Data Fig. 3c). 

Sequence comparison shows that the ZIKV E protein has similar 
sequence homology levels (ranging from 40 to 58%) to other flavivi- 
ruses (Extended Data Fig. 4a). There are some insertions, deletions 
and mutations on the E proteins that are distinct among the neurovir- 
ulent viruses (WNV, JEV and tick-borne encephalitis virus (TBEV)) 
and the febrile-illness-causing viruses (YFV and DENV) (Extended 
Data Fig. 4b). However, their contributions to the resultant pathology 
are not clear. A superposition of our ZIKV cryoEM structure onto the 
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on the ZIKV surface. The Asn154 glycosylation sites are shown as green 
spheres. The three individual E proteins (A, B and C) in one asymmetric 
unit (black triangle) are indicated by transparent surfaces. 


DENV?2 cryoEM" structure, and the crystal structures of WNV”’ and 
JEV”’ E proteins show structural similarities (Extended Data Fig. 5), 
which are consistent with the sequence comparisons (Extended Data 
Fig. 4b). For example, ZIKV is similar to DENV at the hi-loop of DII 
(Extended Data Figs 4b and 5a), where both have a three-residue long 
insertion compared to WNV, TBEV and JEV. However, at other sites, 
ZIKV is more similar to the neurovirulent JEV and WNV (Extended 
Data Figs 4b and 5b). For example, sequence comparison shows an 
Asn67Asp mutation in JEV, WNV and ZIKV compared to DENV, indi- 
cating that the glycosylation site is absent (Fig. 2c). Therefore, there is 
only one Asn154 glycosylation site in an E protein in ZIKV, whereas 
DENV has two (Asn67 and Asn153)'*. The Asn154 glycosylation site 
is suggested to be a neurovirulence determinant in WNV7'; however, 
DENV also has this glycosylation site but infection with DENV rarely 
results in neurovirulence. The Asn67 (but not Asn153) glycosylation 
site in DENV has been shown to be crucial for binding to DC-SIGN 
(also known as CD209)*” on dendritic cells. ZIKV, which also interacts 
with DC-SIGN”’, does not have this glycosylation site, and therefore 
could have a different binding mechanism. 

A longer glycan loop at DI is observed in ZIKV, WNV and JEV com- 
pared with TBEV, YFV and DENV 1-4 (Extended Data Figs 4b and 5b). 
An extra residue is present in the DII kl-loop of WNV, TBEV, YFV and 
ZIKV compared to DENV1-4. A residue insertion at the DII C strand 
in WNV, JEV, TBEV and ZIKV was observed as opposed to DENV 1-4. 
Finally, there is a two-residue insertion at the DII DE-loop in WNV, 
JEV and ZIKV compared to TBEV, YFV and DENV1-4 (Extended 
Data Figs 4b and 5b). 

Notably, our ZIKV structure shows that the one-residue insertion at 
the end of the C strand of DIT (Extended Data Fig. 4b) compared to 
DENV may increase the overall stability of the virus. For comparison, a 
similar resolution DENV2 cryoEM structure", which has been shown 
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Figure 3 | Possible interactions between the ZIKV DIII CD-loops 
around the five-fold vertex may cause differences in the position of 
A-C’ dimer compared to DENV2. a, The CD-loops of ZIKV (green) 

are stretched closer towards the five-fold vertex than those in DENV2 
(yellow). In ZIKV, the Gln350 residue may form a hydrogen bond (dashed 
line) to the Thr351 from the CD-loop of a neighbouring DIII, creating a 
hydrogen bond network around the five-fold vertex. The DENV2 residues 
in the CD-loop (yellow), in comparison, do not interact with other 
neighbouring CD-loops. b, Positional differences of ZIKV A-C’ dimer 
compared to DENV2. Top panel shows the side view. ZIKV and DENV 
molecule A are coloured green and yellow, respectively. Molecule C’ of 
both ZIKV and DENV2?2 is coloured grey. Bottom left panel show molecule 
A rotated by 180° to an axis perpendicular to the viral membrane. The 
ZIKV CD-loop (red arrow) is lifted upwards probably owing to the 
five-fold interaction as demonstrated in a; this molecule rotates slightly. 
Bottom right panel shows molecule C’ in the same orientation as in the 
top panel. The whole ZIKV E protein (green) molecule C’ is positioned at 
a lower radius than in DENV2 (yellow). The positions of the ZIKV stem 
(blue arrow) and transmembrane (TM) regions suggest that the membrane 
is pushed downwards compared to DENV2. 


to be less stable at increased temperatures”!, was used. The Ala340 
insertion in the C strand of DIII allows the CD-loop to stretch further 
towards the five-fold vertex compared to DENV2 (Fig. 3a). Gln350 of 
the CD-loop from one DIII could potentially form a hydrogen bond 
with Thr351 from another CD-loop in the neighbouring DIII, and thus 
may create a hydrogen bond network between the five DIIIs (Fig. 3a 
and Extended Data Fig. 6). In comparison, in DENV2, there is a lack 
of interaction between the DIIIs in this region (Fig. 3a). The possible 
hydrogen bond network around the five-fold vertex of ZIKV could be 
responsible for the rotation of the E protein molecule A slightly (6°) 
anticlockwise around the axis parallel to the viral membrane (Fig. 3b, 
bottom left). When compared to DENV2, the position of DIII of the 
ZIKV molecule A was at a slightly higher radius, whereas DI dips 
slightly closer to the viral membrane. The stem and transmembrane 
regions showed no notable positional differences in molecule A 
between DENV2 and ZIKV (Fig. 3b, top). The ectodomain of mole- 
cule C’, which forms a dimer with molecule A, is positioned at a lower 
radius than DENV2. The stem and transmembrane regions (Fig. 3b, 
bottom right), along with the membrane, are also pressed downward. 
The ectodomains of the B-B’ dimer also lie at a slightly lower radius 
(Extended Data Fig. 7b), although no positional difference in the stem 
and transmembrane regions was observed when compared to DENV2 
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(Extended Data Fig. 7a, bottom). These findings suggest that the ZIKV 
has a more compact surface than DENV2. 

The E proteins on the surface of some DENV?2 strains are shown to 
undergo structural rearrangements at 37°C (ref. 9). In this structure, 
the A-C’ dimer moves outwards to a higher radius and also rotates 
slightly. The B—B’ dimer moves to a higher radius and the E proteins 
dissociate from each other. A study of an antibody, which traps a tran- 
sitional stage of this motion'’, suggests that the A-C’ dimer proba- 
bly moves to a higher radius before the BB’ dimer'?. In the ZIKV 
structure, the possible network of interactions around the five-fold 
vertex between the five A-C’ dimers may result in tighter packing that 
prevents this structural transition. 

Both the TIM (T cell, immunoglobulin and mucin) and TAM (Tyro3, 
Axl and Mer) phosphatidylserine transmembrane receptors have been 
shown to be used by DENV for cell entry”*. Conversely, ZIKV was 
shown to bind to TAM but not TIM receptors”. It is possible that the 
looser surface of DENV2 compared to ZIKV could allow more inter- 
actions with a wider range of phosphatidylserine-binding receptors. 

Unlike DENV, ZIKV is detected in the urine’, saliva’ and semen* 
(which allows potential transmission by sexual contact”), suggesting 
that the virus is stable enough to withstand harsh conditions. This 
is consistent with our infection assay showing that ZIKV is more 
thermally stable than DENV. Furthermore, the potential interactions 
identified in our structure may explain the stability of ZIKV and may 
serve as potential sites to exploit for future therapeutics. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and investigators were not blinded to allocation during 
experiments and outcome assessment. 
Virus plaque assay to determine the stability of ZIKV and DENV serotypes at 
different temperatures. The virus strains used for the plaque assay experiments 
were: DENV2 (NGC), DENV2 (PVP94/07), DENV4 (06K2270) and ZIKV 
(H/PF/2013). Virus samples were pre-incubated for 30 min at 4°C, 29°C, 37°C 
and 40°C and chilled at 4°C for at least 30 min. Samples were serially diluted and 
added to BHK-21 (ATCC) cells grown in 24-well plates. For plaque assays, the 
cells were not specially authenticated, and not tested specifically for mycoplasma 
contamination. Virus was incubated with the cells for 1h at 37°C, then layered 
with methyl cellulose. After incubation at 37°C for 4 days, the cells were fixed and 
stained to determine the plaque-forming units. 

A similar experiment was done by pre-incubating ZIKV H/PF/2013 for 60 min 
at the same temperatures, before adding to BHK-21 cells. 
Virus sample preparation. ZIKV strain H/PF/2013 was grown in Aedes albop- 
ictus C6/36 cells (ATCC) supplemented with RPMI1640 medium and 2% fetal 
bovine serum. The cells were inoculated with ZIKV at a multiplicity of infection 
of 2, and incubated at 29°C for 3 days. Virus-containing medium was subjected to 
centrifugation at 12,227¢ for 45 min to remove cells and other debris, and the virus 
was precipitated with 8% (w/v) polyethylene glycol 8000 in NTE buffer (10 mM 
Tris-HCl, pH 8.0, 120 mM NaCl and 1 mM EDTA) overnight. Virus was pelleted by 
centrifugation at 14,334g, resuspended in NTE buffer and purified through a 24% 
sucrose cushion. Virus was further purified by ultracentrifugation using a 10-30% 
(w/v) potassium tartrate gradient. Virus band was extracted from the gradient with 
a syringe, and then the sample was buffer-exchanged into NTE buffer and concen- 
trated using Amicon Ultra-4 100-kDa centrifugal concentrator (Millipore). Virus 
concentration and purity was estimated by Coomassie-blue-stained SDS-PAGE 
gel, where different concentrations of bovine serum albumin (BSA) solution were 
used as standards. All steps of the purification procedure were done at 4°C. 
CryoEM sample preparation. Virus samples were incubated at 4°C, 29°C, 37°C 
or 40°C, for 30 min, then kept at 4°C before freezing on cryoEM grids. About 2 l 
of sample was put on a copper cryo-EM grid (with lacey carbon covered with thin 
carbon film), blotted for 1s and plunged in liquid ethane bath using the Vitrobot 
plunger (FEI). The grids were stored in liquid nitrogen. 
CryoEM data collection, image processing and 3D reconstruction. Images 
of vitrified virus were collected on a Titan Krios (FEI) microscope at 300 kV in 
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nanoprobe mode using Leginon software for automated data collection. The mag- 
nification was 104,477, giving a pixel size of 1.34 A. The images were recorded 
on Falcon II direct electron detector (FEI) in movie mode, with 30 frames per 
1.8s exposure time, giving a total dose of 50 e~ A~*. The individual frames were 
mutually aligned to correct for beam-induced sample drift with the MotionCorr 
program”®. The first 10 frames were combined to produce the average image 
for further processing. The higher contrast full dose average was also calculated 
and used for defocus determination and for manual particle picking. In total, 
11,600 individual virus particles from 3,083 micrographs were selected. The 
3D reconstruction was carried out with MPSA”’, by using a 3D map of DENV4 
(EMD-2485) as the starting model, with icosahedral symmetry averaging. The 
image data was split into two halves and processed with MPSA independently, 
to follow the ‘gold standard’ procedure for resolution determination. After 
20 cycles of orientation search and 3D reconstruction, the final map was produced 
by using 7,180 particle images. The resolution of the map is 3.7 A, at Fourier shell 
correlation cut off of 0.143 (Extended Data Fig. 2). 

Model building and refinement. The resolution of the map allowed us to trace 
all of the polypeptide chains: three E and three M proteins. After manual struc- 
ture building in Coot*’, using DENV2 coordinates (PDB 3J27) as a starting 
template, modified to include insertions, the protein structure had been refined 
with CNS”? with icosahedral symmetry constraints applied, and showed Rwork 
and Rfee values of 37% and 39%, respectively. For refinement, the map has been 
‘sharpened’ with application of a B-factor of —55 and low-pass filtered to 3.7 A. 
Additional secondary structure restrains were applied to avoid overfitting. After 
refinement, the fit and the geometry of the protein structure was checked using 
Coot tools. 


26. Li, X. et al, Electron counting and beam-induced motion correction enable 
near-atomic-resolution single-particle cryo-EM. Nature Methods 10, 584-590 
(2013). 

27. Liu, X., Jiang, W., Jakana, J. & Chiu, W. Averaging tens to hundreds of 
icosahedral particle images to resolve protein secondary structure elements 
using a Multi-Path Simulated Annealing optimization algorithm. J. Struct. Biol. 
160, 11-27 (2007). 

28. Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of 
Coot. Acta Crystallogr. D 66, 486-501 (2010). 

29. Bringer, A. T. et al. Crystallography & NMR system: A new software suite for 
macromolecular structure determination. Acta Crystallogr. D 54, 905-921 
(1998). 
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Extended Data Figure 1 | The Coomassie-blue-stained SDS-PAGE 
gel profile of ZIKV. The virus preparation showed clear protein bands 
corresponding to the E protein (50 kDa), capsid (C) protein (15 kDa), 
and M protein (10 kDa). The premembrane (prM) band (25 kDa) is not 
detected, indicating that there is little contamination by immature virus 


particles. 
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‘Gold standard’ FSC plot for Zika virus reconstruction 
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Extended Data Figure 2 | Gold standard Fourier shell correlation of the two cryoEM maps, made from odd and even numbered particles. At Fourier 
shell correlation (FSC) cut offs of 0.5 and 0.143, the resolution is 4.3 A and 3.7 A, respectively. 
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Extended Data Figure 3 | Comparison of ZIKV with DENV2 E proteins. 
a, Superposition of ZIKV E protein molecules A, B and C using DI as a 
reference point. The molecules are similar to each other. b, Superposition 
of the three individual DENV2 E proteins in an asymmetric unit by their 
DIs show high similarities between them. c, Superposition of molecule 

A of ZIKV with that of DENV2 by their DIs shows that they have slightly 
different DI-DIII and DI-DII hinge angles. 
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Extended Data Figure 4 | Sequence comparison of ZIKV to indicate the most different regions between the neurovirulent viruses and 
neurovirulent viruses (WNV, JEV and TBEV) and to febrile-illness- the febrile-illness-causing viruses. Contributions of these regions to the 
causing viruses (YFV and DENV1-4). a, b, Percentage sequence virus pathology are unknown. ZIKV contains regions that are distinct in 
similarity (a) and sequence alignment (b) of flaviviruses. Green boxes both virus groups. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


DE-loop 

aa ZIKV 
a= WNV 
= JEV 


glycan loop 


Extended Data Figure 5 | Structural comparison of different regions 

on the E protein sites unique to febrile-illness-causing virus and 
neurovirulant virus to ZIKV. a, Comparison of the hi-loop between ZIKV 
and DENV by superimposing their DIIs. b, Comparison of the glycan loop 
(DI), CD-loop (DIII) and DE-loop (DIII), between ZIKV, WNV and JEV 
by superimposing their DIIs. Comparison of the kl-loop (DI-DII hinge) 
by superimposing these viruses by DI. 
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Extended Data Figure 6 | The fitted CD-loop in the cryoEM density around the five-fold vertex. 


LETTER 


a top view 


Extended Data Figure 7 | The ectodomain of the ZIKV B-B’ dimer is 
positioned lower in the radius compared to DENV2. a, Top panel shows 
the top view of the B-B’ dimer of ZIKV (green) and DENV2 (yellow), 
while the bottom panel shows the side view. The position of stem and 
transmembrane regions in both ZIKV and DENV are similar. b, The ZIKV 
E ectodomain is positioned at a slightly lower radius than DENV2. Stick 
connecting similar atoms in ZIKV and DENV is shown to help with the 
comparison of the positions of the E ectodomains. 
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CAREERS 


SCIENCE TO MARKET A neuroscientist takes on 
deal-making and tech transfer p.434 


JOB SKILLS Teach yourself to love 
networking go.nature.com/cwocoj 


NATUREJOBS For the latest career 
listings and advice www.naturejobs.com 


Take time to make brief visits to faculty members, who can provide leads on grants, awards and jobs. 


RELATIONSHIPS 


Doctor’s advice 


US doctoral students may find that their dissertation 
committee can double up as a board of career mentors. 


BY ALAINA LEVINE 


a certain advantage over students else- 

where. They are often required to assem- 
ble a dissertation committee — mentors who 
advise on topic selection and research ideas, 
scrutinize data, edit drafts and help to direct the 
work through submission and defence. 

But members of a dissertation committee 
can often do a great deal more than that. They 
can lend emotional support or help to expand 
professional networks. They can offer leads 
and advice on grants, awards and jobs. 

To maximize this relationship, the canny 
PhD student will select committee members 


p:: candidates in the United States have 


with care and nurture the bond (see ‘How to get 
the most out of your dissertation committee’). 
Those on both sides of the experience advise 
staying in close and regular contact with men- 
tors from day one, then keeping them updated 
on your evolving interests and progress during 
the doctoral programme — and beyond. “Ifthe 
committee is chosen wisely, these people can be 
pillars of support for years to come,’ says Jelena 
Kovacevic, a biomedical engineer at Carnegie 
Mellon University in Pittsburgh, Pennsylvania. 

Steven Senger talks of his dissertation- 
committee members in glowing terms. Dur- 
ing his 2005-11 mathematics PhD, one found 
funding for him to attend several conferences 
and accompanied him to each, including a 
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prestigious annual maths summit that unites 
researchers from academia, industry and gov- 
ernment. Senger, who is now a mathematician 
at Missouri State University in Springfield, notes 
that his mentor “was careful to pick certain peo- 
ple whom he wanted me to meet and have lunch 
with” One such meeting spawned a collabora- 
tion with a prominent senior colleague. 

Another of Senger’s committee members 
explained how to cultivate collaborations and 
how best to conduct research in maths. “He 
walked me through the logical thought process,” 
Senger says. “He didn't just regurgitate a paper 
to me — he explained how he got to this conclu- 
sion. He treated me like an apprentice. There's 
a difference between giving someone the bare 
facts and training them to bea peer.” 


HONESTY APPRECIATED 

To glean the most from the relationship, it’s 
best to be honest about what you want to 
do with your career, even if it differs from a 
mentor’s own path — as maths PhD student 
Vyacheslav Kiria-Kaiserberg learned. In 2011, 
he asked mathematician Alex Iosevich of the 
University of Rochester in New York to be one 
of his committee members. The two subse- 
quently spent many hours talking informally, 
and Kiria-Kaiserberg was upfront about his 
hopes to work in industry. So Iosevich intro- 
duced Kiria-Kaiserberg to his brother Sam, a 
manager at a data-science company in Chi- 
cago, Illinois, over dinner at Iosevich’s home. 

The informal meeting was a success by all 
accounts: this summer, after Kiria- Kaiserberg 
defends his dissertation, he will start work as 
a consultant at Sam Iosevich’s company. “It’s 
very important to choose the right people,” 
says Kiria-Kaiserberg. “You can benefit from 
them and they can benefit from you.” 

Openness also helped medical researcher Luz 
Claudio in 1991 as she wrapped up her neuro- 
pathology PhD. She knew that she was more 
interested in applying her research to problems 
in the community than studying neurons at the 
bench, and had talked often of her interests with 
a committee member, a neuropathologist at her 
institution. He suggested a science-policy fel- 
lowship, and he wrote a recommendation that 
led to her winning one with the US Environ- 
mental Protection Agency (EPA). 

After the stint ended, Claudio knew that she 
wanted to work at the intersection of public- 
health research and policy. Today, as head of the 
division of international health at Mount Sinai 
School of Medicine in New York City, Claudio 
has won renown for her work onasthmain > 
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TRADE TALK 
Cell negotiator 


Ben Thiede 

is a business- 
development officer 
at STEMCELL 
Technologies in 
Vancouver, Canada. 
As a neuroscience 
graduate student, he 
worked out methods 
to coax stem cells to 
take on specialized properties of the inner ear. 
Now, he works out intellectual-property and 
other deals that help to bring technologies to 
the scientific market. 


When did you first think about jobs beyond 
the bench? 

About two years before I graduated from my 
PhD programme at the University of Virginia 
in Charlottesville, I got married and started 
thinking more long term about my career. 
Before that, science was just fun. 


What led you to your current career? 

A classmate who was a year ahead of me 
was graduating with her PhD and going to 
law school. She had been exposed to other 
careers through an internship for the univer- 
sity’s licensing and ventures group, working 
on technology transfer — helping to license 
and commercialize academic work. I did 
some informational interviews along the 
same lines, but I got an internship by talk- 
ing to a guy who was fitting me for a suit. He 
was the father-in-law of one of the licensing 
managers at the university's patent founda- 
tion. I e-mailed the manager, and he took 
me on. 


How did your internship compare to the lab? 
Asa grad student, you think narrowly about 
one problem. My learning curve in tech trans- 
fer was steeper than it was in the lab. I had to 
quickly pick up what was important about a 
technology: how does it fit into the field, and 
what is its value? It was really interesting for 
me to be close to science that was close to the 
point of being useful to people. 


What do you do now? 

Tam looking for technologies, and evaluating 
whether they work or not and what is already 
patented. After we negotiate a license to bring 
ina technology, I can see that in six months or 
a year we are going to be using it in a product 
that researchers can buy and use in their labs. = 


INTERVIEW BY MONYA BAKER 
This interview has been edited for length and clarity. 
See go.nature.com/xdttdg for more. 
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MENTOR MATCH 


How to get the most out of your dissertation committee 


e Diversify Populate your committee with 
people who are in your subfield and in 
related fields. Consider adding members 
who work at institutions other than your 
own, or outside academia. 

@ Communicate early and often Meet 
privately with your committee members 
so that you can develop a personal 
rapport. Check whether they prefer phone, 
text, e-mail or in-person contact. 

@ Be honest and clear about your goals 
and concerns Discuss your professional 
objectives, such as whether you want to 
work in academia or industry, move to a 
different city or nation or pursue a career 
in science, technology or elsewhere. The 


> low-income communities. She credits her 
relationship with the committee member: 
“That experience changed my life.” 

Although frequent face time is important, 
PhD students should choose at least one com- 
mittee member who is outside their own insti- 
tution, says environmental scientist Tammy 
Newcomer Johnson. That external connec- 
tion, she says, can help to maximize potential 
career gains. Her own adviser at the Univer- 
sity of Maryland in College Park encour- 
aged her to diversify her committee, and she 
invited two ‘outsiders, including Paul Mayer, 
an EPA ecologist whom she had met during 
fieldwork. They discussed career possibilities 
in the federal government, and Mayer noted 
links between Newcomer Johnson's doctoral 
research and the EPA headquarters in Wash- 
ington DC — where she is now a postdoc. 

Thanks to a recommendation from him, 
she was invited to present a seminar at an 
EPA lab on monitoring stream restoration. 
Her talk boosted her CV and led to new con- 
tacts in the agency, who in turn gave her more 
information about research posts there. Now 
she is mulling over staying with the agency 
for the long haul. “It gave me a sense of what 
a scientist does at the EPA — I realized that 
this could be a really wonderful career path,’ 
she says. 

Contact with committee members can 
stretch far beyond those PhD years, as can the 
benefits. Kovacevic has sat on about 30 dis- 
sertation committees; one of her former stu- 
dents turned to her for advice half a dozen 
years later while pursuing an assistant-dean 
position at a major US university. Kovacevic 
proofread the CV and suggested improve- 
ments. She has also nominated another for- 
mer student for membership ofa prestigious 
electrical-engineering committee. 

Indeed, there is much for a PhD stu- 
dent to gain from a warm association with 
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more they know about you, the more they 
can help you. 

@ Stay in contact Send e-mails, ask for 
coffee appointments, request short Skype 
chats and send holiday greetings. 

e@ Promote yourself Keep committee 
members abreast of your progress, 
successes, Challenges and aspirations 

so that they will keep you in mind for 
potentially game-changing career 
opportunities. 

@ Pay it back Look for ways to help them, 
too. You have a network and access to 
information, ideas and opportunities. Work 
out how you can assist them with their 
career and professional interests. A.L. 


committee members, but the best former 
students will also consider what they can give 
back. In January, Newcomer Johnson invited 
members of her former committee to give 
talks at the EPA. “I'm connecting them with 
my network,’ she says. A former student of 
Kovacevic’s landed a desirable position at a 
company and has since made sure to ask the 
professor to circulate the company’s job open- 
ings to her students. “This is a two-way ben- 
efit,” says Kovacevic. “She gets help to succeed 
at her job, and the benefit to me and my insti- 
tution is placing students at a good company.’ 

Students should remind faculty members 
whom they invite to serve on their dissertation 
committee that they are likely to benefit. “One 
measure of success for professors is the success 
of their students,” says Claudio. “When you are 
up for promotion and tenure, the better your 
students do, the better you look” 

In the end, seasoned committee members 
say, the rewards of a dissertation commit- 
tee can go both ways. “Understand this is an 
incredible potential resource,’ says Kovacevic. 
“Do as much as you can to nurture it.” m 


Alaina Levine is a freelance science writer 
based in Tucson, Arizona. 


CORRECTIONS 

The Careers Feature ‘Take my advice’ 
(Nature 532, 531-533; 2016) wrongly 
implied that the Houston-based consulting 
club is run by the Texas Medical Center. In 
fact, it is an independent organization. 
The Careers Feature ‘Change is in the air’ 
(Nature 532, 403-404; 2016) wrongly 
affiliated climate-change ecologist George 
Divoky with the University of Alaska 
Fairbanks. He is actually now director of 
Friends of Cooper Island in Alaska. 


mUeasm SCIENCE FICTION 


PROJECT EARTH IS LEAVING BETA 


Get ready for a brand new experience! 


BY J.W. ALDEN 


ear Backers, 
D The beta test of Project Earth 

is finally over. We can't thank 
you enough for your patience and 
generosity during our crowdfund- 
ing period. But before we roll out 
Earth 1.0, there's a few things you 
should know. 

First, the bad news: this update 
comes with a server wipe. Yes, we know 
you've poured time and effort into your 
‘lives’ on Earth, and it’s disappointing to 
lose your progress. Unfortunately, this can't 
be helped. We experimented with methods 
of porting existing avatars into the new ver- 
sion, but it brings a host of compatibility 
issues with our new character-creation sys- 
tem (more on that later). As a consolation, 
were planning another exciting in-game 
‘End of the World’ event, so you can go out 
with a bang. 

Here are some big changes to expect in 
the new update: 


NEW: Abuser Report System Every multi- 
player experience comes with griefers and 
trolls. Unfortunately, the nature of Earth's 
gameplay makes it very difficult to tell when 
someone is being an asshole on purpose, but 
we've implemented a system that lets you 
flag an avatar for review. This notifies our 
mod team, who will examine the player's 
behaviour for signs of intentional griefing. 
Please don't abuse this feature! Only report 
an avatar if it exhibits unreasonable hostil- 
ity that could not possibly be the result of 
normal ‘human’ development. 


NEW: Advanced Career Paths A limited 
number of political and business leadership 
careers will open to the general playerbase. 
Previous builds allowed only members with 
mod privilege to take on certain leadership 
positions in-game, which allowed us to shape 
the progress of ‘civilization’ as we saw fit. 
We've listened to your feedback on this policy, 
and were thrilled to open this aspect of the 
game to all players. We're also hoping this will 
address the ‘conspiracy theory phenomenon 
that became prevalent in previous builds. 


NEW: Space Exploration Many of you were 
quite vocal when we curtailed the space race 
during our ‘cold war’ in-game event. At 
the time, we worried about the playerbase 
advancing past the capabilities of our current 
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build, so we steered focus back towards Earth 
itself. Since then, we've been working hard on 
a new procedural engine for the cosmos, and 
now were ready to roll it out. We hope this 
will revitalize many career questlines, includ- 
ing the astronaut path. Oh, and dont think 
we havent heard your complaints about the 
light-speed barrier. These things take time. 


FIXED: Character Imbalance We turned 
a blind eye to this for a while, because we 
knew a fix would cause waves, no matter 
how we approached it. But after long delib- 
eration, we've given the character-creation 
suite an overhaul. A proprietary algorithm 
now determines your character's congenital 
traits, including things such as gender, race 
and sexual orientation. Any trait your avatar 
is ‘born’ with will be free of manipulation 
from players or mods. We know this might 
upset those of you who feel these traits are a 
matter of player choice, but we believe this 
is the more realistic approach. An increase 
in diversity will only enhance our experience 
on Earth. Plus, the inexplicable dominance 
of the ‘straight white male’ template kept 
crashing parts of the system. 


FIXED: Dreams We've finally isolated 
the cause of the ‘dream’ phenomenon, in 
which an avatar retains glimpses of the 
player’s real-world memories upon wak- 
ing from logout status. Some team mem- 
bers argued that we should make this bug 
a supported feature, owing to the impact of 
dreams on art and culture in Earth 0.91. In 
the end, we're not comfortable with the risk 

involved. Although it 


> NATURE.COM hasn't happened yet, 
Follow Futures: this dream glitch 
 @NatureFutures could lead to avatars 
Hi go.nature.com/mtoodm = realizing that their 
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‘lives’ are a game. We're not entirely sure 
what would happen if this occurred, 
but we're looking to avoid the sudden 
emergence ofa secret resistance group 

working towards exposing the simula- 

tion, so we decided to remove them. 


FIXED: History Leak There's been 

discussion on the forums about a 

‘history leak’ phenomenon, in 
which various myths, legends and 
religious tales seem to reflect previous 
builds of Earth. For instance, the Greek pan- 
theon resembles the superuser avatars that 
the development team used in our alpha test. 
Many flood myths bear a strong resemblance 
to the ‘End of the World’ event from our first 
playable build. While working on the dream 
bug, we discovered a crucial link between 
these two phenomena, which helped us to 
iron this one out. Sorry, but your old avatar 
wont be worshipped as a messiah in Earth 1.0. 


As this marks our official release, we antici- 
pate a large influx of new players. We've set 
up a special section of the forums for first- 
timers. Please consider paying it a visit and 
offering some advice to the noobs. Remem- 
ber how disorienting your first trip to Earth 
was (especially if you installed the ‘child- 
hood’ DLC expansion). 

The new ‘End of the World’ event is 
scheduled for 21 December. We expect a 
doomsday cult or two to arise before then, 
as some of you will unwittingly commu- 
nicate this date to your avatars in dreams. 
For this reason, we're keeping the nature 
of the apocalypse a secret until the end is 
upon you. Rest assured we've come up with 
something exciting and different this time 
around. No floods, plagues or supervolca- 
noes. We'll give you one hint: watch the sky. 

On behalf of everyone here at Blue 
Marble, wed like to thank our backers and 
beta testers. Our little world couldn't have 
made it this far without you, and we hope 
that you're all permanent residents. Earth 1.0 
is only the beginning! 

E. D. Amun 

Development lead, Project Earth 

Blue Marble Games, Subsidiary of 
Ogdoad Entertainment. = 


J. W. Alden is a graduate of Odyssey 
Writing Workshop and a Ist Place Writers 
of the Future winner. He lives in Florida 
with his wife, Allison. Find him at www. 
AuthorAlden.com. 
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obody likes to talk about their bowel movements. 

This squeamishness means that irritable bowel 

syndrome (IBS), which entails a change in stool form 
and frequency, is widely under-reported. The condition is 
comprised of several symptoms, such as diarrhoea or bloating, 
that are also present in other diseases, complicating efforts to 
study and treat it. IBS can also be associated with mental-health 
conditions such as anxiety or depression. In the past, this led 
some clinicians to categorize it as mainly a psychiatric concern, 
which for many years also attracted unwarranted stigma. This 
Outlook describes these difficult issues, the progress made and 
the challenges that remain (see page S118). 

IBS as we now knowit has a short history, but it is heavily 
influenced by work that goes back centuries (S102). Diagnostic 
capability took a leap forward in the 1980s with the Rome 
criteria — the fourth iteration is published this month 
(S107). But what many people want is a reliable diagnostic 
test (S110), which would also lead to better, more targeted 
drugs (S116). One subtype of IBS caused by gastroenteritis 
might already be treatable — or even preventable (S114). 

Food can trigger symptoms, and a diet low in FODMAPs 
— undigestible carbohydrates — has gained many supporters. 
But not all researchers are convinced of its effectiveness 
(S108). Something does happen in the large intestine, however, 
and researchers are trying to decipher the influence of the 
microbiome on the gut-brain axis (S104). Whatever the cause, 
one thing is clear — although IBS is mild for many people, 
for 1 in 20 patients it is a severe and even life-threatening 
condition. But help is available, and one clinician says that the 
first step is simply to listen to patients (S112). 

We are pleased to acknowledge the support of Allergan plc 
in producing this Outlook. As always, Nature retains sole 
responsibility for all editorial content. 


Michelle Grayson 
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MIND AND BODY 


< ¢ 460-322 Bc 
> 


Medical science can be traced back to 
Hippocrates in ancient Greece. He and his 
contemporaries — Plato and Aristotle — 
develop the concept of holism: the mind 
and body are one, and medicine should treat 
both. For example, Hippocrates not only 
recommends physical treatments such as 
enemas (pictured), but also soul-enriching 
activities such as listening to music. Dysen- 
tery, which is now recognized as a trigger 
of IBS, was then thought to result from an 
overabundance of blood. 


1637 


> 


French philosopher René Descartes proposes the theory of dualism: the spirit and body are distinct 
entities. Because the spirit is no longer thought to inhabit the body, autopsies become more acceptable. 
Physicians uncover the pathological roots of many conditions and begin to develop treatments. But 
superstition is not banished altogether. Conditions with no identifiable pathology, including mental- 
health disorders such as schizophrenia, are blamed on possession and, later, insanity. 


1800s 


> wi 


Several physicians publish accounts of bowel complaints with names such as membranous enteritis and 
colitis. “The bowels are at one time constipated, at another lax, in the same person. How the disease 

has two such different symptoms I do not profess to explain,’ writes William Cumming in the London 
Medical Gazette, adding, “The mental phenomena consequent on it amount, in some cases, to absolute 
insanity.’ Treatments included dietary changes, opiates, mustard poultices and electrotherapy. 


1945 


> 


The Second World War is over, but not the intestinal troubles of 
many soldiers and sailors who contracted dysentery in the trop- 
ics. Their symptoms continue or recur even after the causative 
microbes have gone. Writing in the British Medical Journal, G. T. 
Stewart suggests that long-term changes to intestinal flora might 
contribute to lingering diarrhoea. 


Drink only 
approved water 


ROAD TO ROME 


> wi 


Nazir Chaudhary and Sidney Truelove describe what they refer to as irritable colon syndrome. To their 
patients, the pair call the problem misbehaviour of the bowel and treat it with the sedative phenobarbital 
and the muscle relaxant propantheline bromide. Patients also report psychological symptoms such as 
anxiety or depression. Over the decade, the term irritable bowel syndrome becomes more popular. 
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Transit time 


THOTH’S BEAK 3500-2500 be 


The ancient Egyptian ibis-headed god 
Thoth (pictured) was associated with 
the enema — ibises were thought to self- 
administer using their long beaks. The 
Egyptians had specialized physicians to 
treat abdominal problems. 


BIOFEEDBACK (884 


American psychologist William James 
proposes a direct link between emotions 
and the body. For example, the theory 
suggests that the feeling perceived as 
anxiety results from the sensation of 
butterflies in the stomach. 


ABOMINABLE ABDOMINALS 1920s 


Some physicians apply derogatory terms 
to people who repeatedly seek help for 
digestive symptoms. In his lectures on 
dyspepsia, British physician Robert 
Hutchison complains of the “incessant” 
demands of the “abdominal woman”. 


MOULD BREAKING {928 


British biologist Alexander Fleming 
discovers a mould that kills bacteria, 
leading to the development of penicillin 
and other antimicrobials. These drugs will 
help to tame the cause of dysentery. 


PRESIDENTIAL PROBLEMS 196! 


During his time in office, then US 
president John F. Kennedy suffers from 
severe diarrhoea — as he did for much of 
his life. Doctors diagnose spastic colitis, 
which would now be called IBS. 


CLOCKWISE FROM TOP LEFT: JOSEPH KUHN REGNIER/ADOC-PHOTOS/CORBIS; ROGER WOOD/CORBIS; WORLD HISTORY ARCHIVE/TOPFOTO 


L: PHOTO ITAR-TASS/ YURY BELINSKY/ALAMY; R: JASON MERRITT/GETTY 


IRRITABLE BOWEL SYNDROME (ieltyy tere) 


The term irritable bowel syndrome (IBS) is relatively modern, but people have had similar 
symptoms for millennia. Initially thought to exist only in the mind, the disorder has gained 


legitimacy through the identification of causes and improved diagnosis. By Amber Dance. 


RELAX 1965 


The first drug marketed specifically for 
IBS in Europe is the anti-spasmodic 
mebeverine, which relaxes the gut 
muscles. 


TUMMY BUG 
1984 


Australian 
physician Barry 
Marshall drinks 
Helicobacter pylori 
to prove that the 
bacterium causes 
gastritis and ulcers, disproving the long- 
held view that these conditions have a 
purely psychological origin. 


STRENGTH IN NUMBERS 1987 


The patient organization the IBS Self 
Help and Support Group is formed in 
the United States. It now has around 
60,000 members. 


DUALISM DEBUNKED 1988 


The idea that IBS can result from 
interactions between the brain and the 
gut emerges from a survey reporting 
that certain psychological conditions 
typically make the syndrome worse, and 
lead people to seek medical attention 
(D. Drossman et al. Gastroenterology 95, 
701-708; 1988). 


HEART HURTS 2007 


Pharmaceutical company Novartis 
withdraws tegaserod, a drug for 
constipation-predominant IBS, after 
reports that it can cause severe cardiac 
problems in some people. 


IN THE GENES 2014 


About 2% of people with IBS carry 
mutations that cripple the SCN5A gene, 
which is known to encode a sodium 
channel that acts as a pacemaker for 
contractions in both the gastrointestinal 
tract and the heart (A. Beyder et a/. Gas- 
troenterology 146, 1659-1668; 2014). 


1987 


> 


Gastroenterologists gather in Rome to draft guidelines for the diagnosis of IBS, helping to 
legitimize the condition. The criteria generate some controversy: they do not perfectly distinguish 
IBS from other similar conditions, and they are difficult to validate. But use of these guidelines 
grows in clinics, research and drug trials, and grant agencies insist on their use (see page S107). 


2000 
> 
In February, the US Food and Drug Administration (FDA) approves the serotonin-receptor blocker 
alosetron for women with diarrhoea-predominant IBS. By November it has been withdrawn. Several 
patients have had complications due to constipation, such as a ruptured bowel, and some have died. 
Under public pressure, the FDA allows its use again in 2002, but only for women with severe diarrhoea. 


2004 
wa 
Researchers describe the first evidence of a specific molecular defect that may be a cause of 
IBS. The guts of people with IBS, as well as those with ulcerative colitis, contain relatively 
low amounts of both serotonin and an enzyme that synthesizes it, as well as a transporter 


that brings serotonin into cells (M. D. Coates et al. Gastroenterology 126, 1657-1664; 2004). 
Although this result is initially controversial, later studies confirm it. 


BACTERIAL BOUNTY 
2008 


~~ wa 


Evidence emerges that the low-FODMAP diet, which omits foods contain- 
ing short-chain carbohydrates, can improve symptoms in people with IBS 
(S. J. Shepherd et al. Clin. Gastroenterol. Hepatol. 6, 765-771; 2008). It is 
avery restrictive diet, but, because it seems to help up to two-thirds of 
people, it is increasingly adopted, often without medical advice (see 
page S108). 


2011 


wa 


Faecal analysis reveals that people with IBS have an altered gut microbi- 
ome, including more representatives of the phylum Firmicutes and fewer of 
Bacteroidetes (see page $104). This discovery opens up possibilities for diagnostic 
tests and treatments. Later, children with IBS who have certain microbes in their gut 
are found to be more likely to benefit from the low- FODMAP diet. 


2016 » )} 
> 


The stigma surrounding IBS continues to fade. During the Super Bowl, football fans are 
shown an advertisement for the drug rifaximin, featuring an animated intestine. Celebrities 
such as supermodel Tyra Banks (pictured) have previously spoken about their digestive 
symptoms. But despite the recent approval of several medications, scientists have much 
more to do to untangle the disparate causes of IBS. 
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MICROBIOME 


Bacterial broadband 


The involvement of intestinal bacteria in gut-brain communication could help to explain the 
mysteries of irritable bowel syndrome, but the search continues for definitive evidence. 


BY MICHAEL EISENSTEIN 


ost people prefer to imagine a 
M healthy distance between their brain 
and colon, but in fact the two are 
intimately linked. Neuronal lines of commu- 
nication help the brain to control the digestive 
process, and messages relayed back by the gut 
can influence both perception and behaviour. 
Evidence for another important player in 
this dialogue is coming to light. The human 
digestive tract is host to a massive and diverse 
community of bacteria that aid digestion and 
strengthen immunity — and over the past 
decade, scientists have observed that the gut 
microbiota also interact with the central nerv- 
ous system (CNS). Asa result, mental health 
and even neurological development might 
both shape and be shaped by the composition 
and behaviour of these bacteria. 

Symptoms as diverse as gastrointestinal 
distress, physical pain and depression may all 
converge on the health of our bacterial com- 
munities. This could be especially relevant in 
irritable bowel syndrome (IBS), in which phys- 
ical effects such as constipation and diarrhoea 
often co-exist with psychiatric problems such 
as anxiety and post-traumatic stress. Animal 
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experiments offer some evidence of bacterial 
involvement in the condition, whereas studies 
in people with IBS show that a dose of healthy 
probiotic bacteria could mitigate both diges- 
tive discomfort and psychiatric symptoms. 

These and other findings suggest that the 
microbiota act as crucial ‘first responders, 
relaying health-related signals between the 
digestive tract and the brain. Perturbations 
in this gut—brain—microbiome axis may play 
a central part in IBS, but much of the evidence 
for this is patchy or circumstantial. Many ques- 
tions remain about the extent to which this 
axis influences IBS pathology, about whether 
defects in microbiota are a cause or a conse- 
quence of the condition, and about how to turn 
this knowledge into treatments. 


MIND-BODY CONNECTION 
The existence of a brain-gut axis is not a 
particularly avant-garde idea. The roots of 
the concept go back to the late nineteenth 
century, when the physicians William James 
and Carl Lange first proposed that emotional 
response might be directly modulated by sig- 
nals transmitted from the viscera to the brain 
(see page S102). 

The intestinal wall maintains bidirectional 
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communication with the CNS through the 
vagus nerve, which runs from the brainstem 
to the gut and manages numerous unconscious 
tasks throughout the body, such as maintain- 
ing heart rate. The vagus nerve also relays 
essential stress-response signals. Its interaction 
with the gut regulates almost every aspect of 
the passage of digested material through the 
intestines — explaining why CNS disorders are 
so often accompanied by digestive problems. 

“If you have alterations in the brain, you will 
almost certainly have altered output to the gut 
because the two organs are that closely con- 
nected,” says Emeran Mayer, a gastroenterolo- 
gist at the David Geffen School of Medicine, 
University of California, Los Angeles. Con- 
versely, feedback signals from the gut to the 
brain can go well beyond obvious hungry- 
versus-full messages; abnormalities in the 
digestive system can directly shape both cog- 
nitive and emotional state. 

Research over the past few decades has 
clarified the role of this gut—brain axis in IBS. 
For example, brain-imaging studies by Mayer 
and colleagues have identified both structural 
and functional alterations in the CNS associ- 
ated with IBS, including notable changes in 
specific brain regions that respond to pain and 


SARAH J. COLEMAN 


discomfort’. Mayer's group has also identified 
top-down effects of psychosocial trauma and 
post-traumatic stress — especially from early 
in life. “In patients with IBS,” says Mayer, “up to 
60% indicate that stress is the factor that trig- 
gered either the first onset or the exacerbation 
of symptoms.” 

Furthermore, many people with IBS have 
associated psychiatric issues, and some find 
that mental-health-oriented treatments can 
improve their gastrointestinal symptoms. “Cog- 
nitive behavioural therapy, hypnotherapy and 
psychodynamic therapy are all quite effective,” 
says Magnus Simrén, a clinical gastroenterolo- 
gist at the University of Gothenburg in Sweden. 
And although trials of such interventions for 
IBS have proved challenging to perform and 
interpret, he notes that “they have still been 
quite impressive when it comes to their effec- 
tiveness in treating IBS”. 

Numerous lines of evidence also link IBS to 
perturbations of the gut microbiome. Many 
IBS cases arise after a gastrointestinal infection, 
which can markedly change the composition 
of the gut microbiota both directly, by killing 
off certain beneficial species, and indirectly, 
by damaging the intestinal environment that 
supports them (see page $114). Drugs can also 
influence the health of the microbiome. John 
Cryan and his colleagues at University College 
Cork in Ireland have shown that rats treated 
early in life with a cocktail of antibiotics that 
kills off certain types of gut microbe can expe- 
rience long-lasting gut hypersensitivity’ — a 
common symptom of IBS. 

However, the IBS microbiome data are still 
swimming into focus. Each gram of human 
colon tissue can contain up to one trillion 
bacteria, representing at least ten major cat- 
egories (or phyla) that comprise many poorly 
characterized subtypes. This means that most 
microbiotal data represent high-level surveys 
rather than detailed censuses. 

Microbiologist Willem de Vos of 
Wageningen University in the Netherlands 
was among the first to profile microbiome 
changes in large numbers of people with IBS’. 
His team’s surveys revealed some notable 
trends, including a reduction in the numbers 
of ‘good’ species, such as those that belong to 
Bifidobacterium, that typically form the bed- 
rock of a healthy microbiota, and an increase 
in Ruminococcus species that are associated 
with infection. “We've seen some patterns, 
and there's reason to believe that there are 
robust signatures of IBS,” he says. But the 
results so far are conflicting. Some research- 
ers have observed reduced microbial diversity 
in IBS, whereas others — including Mayer's 
group — have seen the opposite. 

Healthy microbiota are thought to collaborate 
with host epithelial cells to maintain the intes- 
tinal wall. Destabilization of this relationship 
could compromise the gut and impair diges- 
tion or excretion — defining symptoms of 
IBS. De Vos, together with Robin Spiller at the 


University of Nottingham, UK, showed that 
alterations to the microbiota in people who 
developed IBS after an infection were accom- 
panied by striking changes in intestinal gene 
expression’. Importantly, these correlated with 
alterations in specific subpopulations of gut 
bacteria. Moreover, these changes seem to be 
self-perpetuating in people with IBS. 

“Some people with Campylobacter 
infections get diarrhoea for a while and then 
go back to normal,” de Vos says. “But others 
don't go back and have recurrent problems.” 
In other words, he says, their microbiome falls 
into what is known as an “alternative stable 
state” that locks them into IBS. 


A BACTERIAL BRIDGE 

Only in the past few years have researchers 
come to suspect that all three elements — gut, 
microbiota and brain — intersect in IBS. Early 
evidence for a brain—microbiome connection 
came in 2011 from scientists at the Karolinska 
Institute in Stockholm. They demonstrated that 
‘germ-free’ rodents that have been reared under 
sterile conditions and lack a microbiota have 
a reduced anxiety response in various labora- 
tory tests relative to those with intact bacterial 
communities’. Other studies with germ-free 
animals have shown that disruptions in the 
microbiota can increase sensitivity to pain and 
bring on cognitive symptoms that resemble 
depression and anxiety. 

New evidence sheds light on the mechanism 
of the gut microbiota’s impact on nervous- 
system function. Researchers in Japan have 
found that germ-free mice produce lower than 
normal levels of dopamine and adrenaline’, 
two important molecules in the CNS response 

to stress. The microbiota 


“Theanimal also influence production 
data are of serotonin — an intrigu- 
almosthard __ ing finding given that this 
to believe neurotransmitter plays a 
because central part in gut motor 
they’re so function and digestion, as 


well as in various cognitive 
and mood disorders. 

Roughly 95% of the body’s serotonin is pro- 
duced in the digestive tract, and a team led by 
Elaine Hsiao, now at the David Geffen School 
of Medicine, has demonstrated in mice that a 
particular subset of gut bacteria directly stimu- 
lates intestinal serotonin synthesis and release’. 
“This is one of the most interesting papers in 
this field from the last ten years,” says Mayer. 
“Those serotonin-producing cells in the gut 
can be viewed as the main hub for commu- 
nication between the gut and brain, because 
they're exposed to what we eat and to signals 
produced by the gut microbes, and innervated 
by connections to the vagus nerve.” 

Several mouse studies suggest that the vagus 
nerve serves as some sort of ‘hotline’ by which 
gut microbes communicate directly with the 
CNS (see ‘A complicated conversation’). Cryan 
and his colleagues have found that the effects on 


dramatic.” 
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the CNS ofa probiotic bacterial strain seem to 
depend on signals relayed by the vagus nerve’. 
“That immediately tells us that vagal pathways 
may be relevant to IBS, and the microbiome 
could be shepherding that,’ Cryan says. A team 
led by Premysl Bercik, a gastroenterologist at 
McMaster University in Hamilton, Ontario, has 
likewise provided evidence that probiotics can 
treat anxiety-like symptoms in a mouse model 
of colitis — but only if this nerve remains intact 
and functional’. 

The brain can also affect the microbiota and 
potentially contribute to the onset or exacerba- 
tion of IBS. Cryan, Bercik and others have used 
mouse models to show that early-life stress 
can change the composition of gut microbial 
communities, although they are yet to work 
out how. “Many patients who are under stress 
will experience diarrhoea or a change in their 
bowel habits as a consequence, and this could 
potentially affect their microbiotal composi- 
tion,” says Bercik. 

Although IBS is not recognized as an 
inflammatory disorder per se, there is evi- 
dence that inflammatory processes might be 
hyper-responsive in people with IBS, which 
offers another route by which the CNS might 
modulate the microbiome. “Depression is well 
known to be a pro-inflammatory state,” says 
Eamonn Quigley, a clinical gastroenterologist 
at Houston Methodist Hospital in Texas. “We 
know the brain can influence the immune sys- 
tem in general, which can definitely affect the 
microbiome.” But he stops short of labelling 
this as a causal mechanism by which the CNS 
influences IBS symptoms. 


A UNIFYING THEORY 

Collectively, these data suggest that IBS could 
arise from disturbances of one or more physi- 
ological processes, which reinforce each other 
to exacerbate the disease state. “None of the 
previous theories were able to explain IBS 
fully,’ says Bercik. “The microbiota could be 
one of the unifying links.” The CNS response 
to early-life trauma could, for instance, trig- 
ger changes in the microbiota, which might 
then impede intestinal function and alter 
signalling to the brain, further amplifying the 
stress response. Or perhaps a gastrointestinal 
infection or inflammatory response to a food 
allergy could trigger a similar feedback cycle. 
“It’s kind of ridiculous to pin it down to just 
one of these factors because they all fit into this 
big jigsaw puzzle,’ says Mayer. 

This grand unifying theory of IBS is 
appealing, but is yet to garner strong experi- 
mental support. For one thing, links between 
the gut—brain-microbiome axis and IBS are 
built mainly on a foundation of animal stud- 
ies, which may poorly reflect the physiologi- 
cal disruptions of patients. The germ-free 
rodents that have produced such striking 
data, for example, have an extreme pheno- 
type that is never seen in humans. “We need 
to understand how this works with the milder 
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A COMPLICATED CONVERSATION 


The interplay between the digestive tract 
and the central nervous system is 
well-established. Signals produced by the 
gut microbiota are thought to influence 
this gut-brain axis. If they do, disruptions 
to this community of bacteria could have 
a role in exacerbating symptoms of 
irritable bowel syndrome. 


The microbiota is thought 

to influence mood and 
cognition, possibly by 
signalling to the brain 
through the vagus nerve 

or stimulating production 

‘ of neurotransmitters 
*e,, in the gut. But this has 
**,, been shown only in 
animal studies. 


Effects of environmen- The brain The gut 
tal stress on the brain regulates communicates 
can have a lasting digestive- with the brain 


impact on gut health. 


Disrupted gut 
function can 
change the 
microbiota’s 
composition or 
metabolic 
activity. 


Pathogenic bacteria or overexposure 


to antibiotics may kill off microbes 
that assist gut function. 


perturbations of the microbiota that we see in 
humans,’ says Simrén. 

Many studies of stress and its effects on IBS 
have relied on models in which a rodent pup is 
separated from its mother for hours at a time 
in the first few weeks after birth. The resultant 
stress effects can last the animal's lifetime, with 
symptoms that resemble human IBS. “They're 
hypersensitive, they have hyperactive motil- 
ity of the colon, stomach emptying slows and 
they're more anxious,’ says Mayer. But this 
maternal-separation model has not proved 
entirely persuasive. “It’s fantastic that they 
look like they have IBS, but it’s not the same 
thing,’ de Vos says. Indeed, given the apparent 
heterogeneity of IBS, many scientists agree that 
it seems far-fetched to imagine that any one 
animal model will ever tell the full story. 

As a clinician, Mayer recognizes the 
seriousness of this gulf. “The animal data are 
almost hard to believe because they’re so dra- 
matic,” he says, noting that, so far, no results 
have been faithfully replicated in a well-designed 
human study. One of the biggest problems is 
the paucity of human-microbiome data. Many 
studies of IBS have relied on faecal samples from 
small numbers of patients, who may have been 
inappropriately grouped together despite dis- 
playing different symptom patterns. “We need to 
have hundreds of patients in our studies to really 
see where there are differences between groups,’ 
says Simrén. Early studies also failed to account 
for gender differences or the confounding effects 
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tract activity. 


Intestinal wall . 


Microb 


through the 
vagus nerve. 


On bare 


Signals from the 
microbiota 
maintain the 
intestinal wall. 


- de 
Supplementation with ‘good’ 


bacteria, or probiotics, may 
restore the microbiota. 


of dietary habits, which can make a huge differ- 
ence to microbial composition. 

Another important consideration is how 
the microbiome changes over time. An initial 
bacterial community is established at birth, 
but develops as a person matures. Given the 
apparent link between early-life events and 
subsequent development of IBS, researchers 
need to understand the potential effects of 
disruptions to the microbiota in childhood. 
“If we look at the microbiome of human IBS 
patients and notice only modest changes, 
that’s not telling you what happened early 
on,’ says Cryan. “You could have changes 
from early life that potentially affected brain- 
gut signalling permanently.” 

There is also the possibility that the crucial 
element is how the microbiome functions, 
rather than what it comprises. This could help 
to explain why bacterial surveys of IBS have 
yielded such ambiguous results. “The same 
bacteria can exert different functions based 
on the environment and the food that we are 
supplying them with,” says Bercik. The upshot 
is that it may be more informative to focus on 
the microbiome’s biochemical output than on 
its composition. 

Despite the uncertainties in the evidence, 
there are intriguing findings from patients 
that lend weight to the hypothesis that the 
microbiome’s influence on gut-brain com- 
munication is relevant to IBS. A small study 
by Quigley and his colleagues’’ showed 
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that people with IBS who have psychiatric 
symptoms can be distinguished from those 
without, purely by differences in their micro- 
biota. Mayer's team is also continuing to pur- 
sue its brain-imaging work — preliminary 
results show that certain microbial profiles 
seem to correlate with changes in brain regions 
involved in pain perception and processing. 


CAUSE AND EFFECT 

What is still missing is solid evidence of gut 
microbiotal causality in IBS pathology. “We 
don't know yet what is the chicken and what 
is the egg,” says Mayer. Tinkering with patient 
microbiomes could help scientists to address 
this question. Mayer and his University of 
California, Los Angeles, colleague Kirsten 
Tillisch have found that healthy women who 
consumed a dairy drink containing probiotic 
bacteria had altered activity in brain areas asso- 
ciated with perception of emotions and of pain 
and discomfort". This is important evidence of 
microbiota-to-brain feedback, but the effect is 
yet to be investigated in patients. 

In 2015, Bercik and his colleagues reported 
preliminary data that showed that giving pro- 
biotics to patients significantly alleviated their 
depression — and provided a measure of IBS 
relief. And numerous studies indicate that such 
treatments can lessen gastrointestinal symp- 
toms more generally. “There's no question that 
probiotics have an effect,’ says Quigley. “What's 
more difficult to sort out is why they have an 
effect and which ones have what effect.” De 
Vos, together with gastroenterologist Robert 
Brummer of Sweden's Orebro University, is 
exploring the causality issue from the other end, 
with a clinical trial to assess the effects of trans- 
planting faecal bacteria from healthy donors 
into people with IBS. 

Even the microbiome champions 
acknowledge that a better microbial mix will not 
be the answer for every patient. But the consen- 
sus is that a clearer picture of the ‘good guys’ and 
‘bad guys’ of the gut could one day allow doctors 
to quickly dispatch appropriate reinforcements 
to the bacterial battlements, and break patients 
out of the self-destructive IBS cycle. m 


Michael Eisenstein is a freelance science 
writer based in Philadelphia, Pennsylvania. 
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PERSPECTIVE 
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gastrointestinal conditions encountered by clinicians — globally, 

it afflicts around 11% of people of all races, sexes and socio- 
economic status’. It is associated with recurrent abdominal pain 
linked with defecation, a change in bowel habits (constipation, 
diarrhoea or both), as well as other symptoms that do not seem to 
relate to the gut. 

Despite its prevalence, IBS remains frustratingly difficult to 
diagnose. The leading reason is the lack of a good biomarker. 
The presence of multiple, overlapping non-specific symptoms 
(abdominal pain, bloating, distension, diarrhoea and constipa- 
tion), related to the inherent heterogeneity of the disorder, compli- 
cates matters. Then there is the lack of understanding by clinicians 
about the condition, and the hesitance of patients and doctors to 
accept a symptom-based diagnosis. 

During the late 1980s, efforts to improve the diagnosis of IBS and 
to standardize the diagnostic process led to the development of the 
Rome criteria’. The third iteration, Rome III, 
was published in 2006 (ref. 3) and at the time 
was felt to represent the most up-to-date data 
on the diagnosis and treatment of IBS. Rome III 
defined IBS as recurrent abdominal pain or dis- 
comfort at least three days per month in the pre- 
ceding three months, which is associated with 
at least two of the following: symptom improve- 
ment with defecation, symptom onset coupled 
with altered stool frequency, or symptom onset 
coupled with altered stool form. 

During the past decade, however, it became 
clear that Rome III needed revision, partly 
because research improved our understand- 
ing of how IBS develops and of its underlying 
pathophysiological changes. But perhaps most 
importantly, because many clinicians found 
that the Rome III criteria were not as useful as 
they had hoped, and that the guidelines did not reflect real-world 
practice. For example, Rome III did not recommend basic labora- 
tory tests and ignored the fact that abdominal pain often wors- 
ens, rather than improves, with defecation. The criteria were also 
criticized for being mainly designed for researchers and not for the 
busy clinician, who frequently encounters patients exhibiting mul- 
tiple symptoms — both intestinal and non-intestinal. Health-care 
providers reported that some of the Rome III criteria were either 
vague or incorrect. And some researchers and clinicians argued 
that Rome III did not fully distinguish constipation-predominant 
IBS from chronic constipation, or diarrhoea-predominant IBS 
from other causes of diarrhoea. 

The new Rome guidelines‘ aim to address these criticisms and 
to provide clearer guidance for health-care providers based on the 
latest clinical evidence. Published in May, Rome IV covers many 
functional bowel disorders, of which IBS is the most prevalent. It 
is the result of an exhaustive review of the literature and nearly 
three years of work by international specialists, including a team 
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CRITERIA SHOULD 
ENHANCE CLINICIANS’ 


KNOWLEDGE 
OF THIS DISORDER 
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DIAGNOSE IT. 


An easier diagnosis 


The latest iteration of the Rome criteria should improve diagnosis of 
irritable bowel syndrome, says Brian E. Lacy. 


led by myself and Fermin Mearin of the Centro Médico Teknon in 
Barcelona, Spain, that focused exclusively on IBS. 

Rome IV delivers several important changes in the diagnostic 
criteria for IBS. Gone is the term abdominal discomfort — the word 
discomfort is imprecise and is not present in every language. The 
criteria also require that abdominal pain is present at least one day 
per week on average during the preceding three months, up from 
just three days per month. This was based on a survey conducted 
on behalf of the Rome committee, which found that pain is the 
cardinal symptom of IBS*. These new criteria also recognize that 
IBS is often associated with irregular bowel habits — constipation, 
diarrhoea or a mix of both — and that common symptoms include 
bloating and distension. For a condition to be classified as IBS, it 
must be chronic, reflected by the addition that symptoms should 
have persisted for at least six months. The Rome criteria also now 
acknowledge the role of diagnostic tests to exclude other com- 
mon conditions, such as coeliac disease, lactose intolerance and 
inflammatory bowel disease, which have simi- 
lar symptoms to IBS. 

By clarifying the language, updating the 
definition and including the option of labo- 
ratory testing, the new criteria should make 
it easier for doctors to diagnose IBS. The 
increased focus on abdominal pain reinforces 
what is reported by the clinicians who rou- 
tinely care for these patients — that abdominal 
pain is the essential element of IBS. Rome IV 
will also help to distinguish people with IBS 
from those who have intermittent abdominal 
spasms or cramps, as well as from those who 
have chronic constipation or diarrhoea. 

Tremendous advances have been made in 
our understanding of IBS over the past dec- 
ade. Although it is a chronic disorder for many, 
and not yet curable, new therapeutic options 
have significantly improved the symptoms of many patients with 
IBS. The release of the Rome IV criteria should enhance clini- 
cians’ knowledge of this common disorder and allow them to more 
reliably diagnose it. In research laboratories too, a more refined 
definition will help scientists to identify and categorize people with 
IBS and its subtypes, improving our understanding of the complex 
physiology of the condition, and in turn, spur researchers to make 
even greater therapeutic discoveries. m 


Brian E. Lacy is Chief of the Division of Gastroenterology & 
Hepatology at the Dartmouth-Hitchcock Medical Center, New 
Hampshire. He is also co-Editor-in-Chief of The American Journal of 
Gastroenterology. 
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OURAN S IRRITABLE BOWEL SYNDROME 


Although both considered healthy snacks, oranges are lower in FODMAPs than apples. 


Food for thought 


Many people with irritable bowel syndrome feel that they 
benefit from dietary interventions, but researchers still lack 
a full understanding of how food can inflame this disorder. 


BY SARAH DEWEERDT 


r here is no doubt that people with irritable 
bowel syndrome (IBS) and the doctors 
who treat them are obsessed with diet. 

The past five years have seen a huge rise in 

awareness of the low-FODMAP diet, which 

omits particular carbohydrates found in wheat, 
onions and some fruits. Other patients are eager 
to go gluten free, eschewing a protein that is 
found in wheat and some other grains — a diet 
that is also popular outside the IBS community. 

“Patients have high expectations” for such diets, 

says Magnus Simrén, a gastroenterologist at the 

University of Gothenburg in Sweden. 

The idea that food causes IBS symptoms has 
intuitive appeal to many. In surveys, 40-90% of 
people with IBS say that their symptoms have 
dietary triggers. “Patients have been telling us 
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for many, many years that their symptoms are 
associated with eating a meal,” says William 
Chey, a gastroenterologist at the University of 
Michigan in Ann Arbor. These people have 
historically had few options for reliable phar- 
maceutical treatments. 

But we have been down this road before, 
and earlier interest in dietary interventions 
for IBS failed to have a lasting effect on 
patients’ symptoms. Even now, many scien- 
tists caution that we do not know enough 
about the relationship between food and IBS. 
And although low-FODMAP and gluten-free 
diets are gaining traction, researchers say that 
a more rigorous, sceptical attitude is needed 
to ensure that the current interest in diet leads 
to lasting clinical gains. 

Forty years ago, the prevailing theory on 
IBS was that symptoms were triggered by food 
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intolerance or allergies. In the early 1980s, John 
Hunter, a gastroenterologist at Addenbrooke's 
Hospital in Cambridge, UK, reported success 
with a diet that excluded most common food 
allergens, including those found in milk, eggs 
and nuts. 

This approach hit a dead end, however, when 
other clinics had difficulty replicating Hunter's 
results. Furthermore, numerous studies found 
that people with IBS had no higher prevalence 
of food allergies than the general population. 
By the 1990s, diet had largely been abandoned 
as a treatment for IBS, with the focus switching 
instead to pharmaceutical therapies. 

But in the past decade, the pendulum has 
swung back, pushed both by patients’ frustra- 
tion with underwhelming drugs and by new 
research tools for investigating dietary triggers 
of symptoms. “We have a much better under- 
standing now of all the different ways in which 
food might cause gastrointestinal problems,’ 
Chey says. 


FOOD TRIGGERS 

Working out which foods are implicated in 
causing IBS symptoms — let alone which 
ones trouble an individual patient — is tricky, 
however. “Food contains all sorts of different 
components,’ says Kevin Whelan, a dietician 
at King’s College London. Any individual food 
item is chemically complex, he says — and foods 
are rarely eaten in isolation. 

But researchers got a big break in the late 
2000s. A team at Monash University in Mel- 
bourne, Australia, led by gastroenterologist 
Peter Gibson and dietician Sue Shepherd, 
conducted some of the most rigorous studies 
yet of the dietary triggers of IBS. Their work 
centred on FODMAPs (fermentable oligo- 
saccharides, disaccharides, monosaccharides 
and polyols), which are carbohydrates that the 
body has difficulty breaking down or absorb- 
ing. They include: fructose, the form of sugar 
found in fruit; lactose, which is found in milk; 
fructans, which are polymers of fructose found 
in wheat, legumes and onions; and alcohol (see 
‘FODMAP food map’). 

Many FODMAPs had previously been 
implicated in IBS. “We've known that they cause 
problems,’ says Gibson. “But no one had ever 
put them together as a family that all have the 
same effect.” 

In one persuasive study’, Gibson’s team 
put people with IBS on a diet that drastically 
reduced their consumption of FODMAPs. 
Their symptoms improved. Then the research- 
ers fed them fructose, fructans or a placebo. 
Symptoms returned in people given FODMAPs, 
but not in those who received the placebo. 
“They really have shown that FODMAPs can 
induce symptoms in patients with IBS,” says 
Simreén. 

Magnetic resonance imaging (MRI) studies 
of the digestive system have shown how 
FODMAPs cause problems. Poorly absorbed 
molecules such as fructose draw more water 
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into the small intestine, stretching it. Poorly 
digested FODMAPs, such as fructans, are 
fermented by gut bacteria, producing excess gas 
in the large intestine. These effects also occur 
in people without IBS, but the condition is also 
associated with visceral hypersensitivity, so even 
a small distension of the intestine can be painful. 

In addition, in some people with IBS, food 
passes through the gut more quickly than 
normal. In these individuals, any excess water 
drawn into the small intestine will not have time 
to be reabsorbed in the large intestine, result- 
ing in diarrhoea. “A significant proportion of 
people with IBS will benefit from reducing their 
FODMAP intake,” says Whelan. He and other 
proponents of the low-FODMAP diet say that it 
can help up to 70% of people with IBS. 


HIT THE FLAW 
Not everyone is convinced, however. Some 
researchers argue that many of the studies pur- 
porting to show the benefits of a low-FODMAP 
diet have methodological flaws — difficulties 
that are commonly found in dietary studies. 

The main problem is that people are aware of 
what they are eating, which means that unlike 
in studies of experimental drugs, it is difficult 
to blind participants to whether they are in the 
treatment or control group. This makes these 
studies more vulnerable to both placebo and 
nocebo effects (in a nocebo effect, the thought 
of eating a trigger food could bring on negative 
symptoms). “Expectations play a big role here,” 
Simreén says. 

To address this problem, Simrén’s team tried 
to take expectations out of the equation. In 
2015, they published a study’ of 75 patients 
who were randomly assigned to follow either 
a low-FODMAP diet or conventional dietary 
advice for IBS, which focuses on avoiding 
large meals, fat, insoluble fibre, caffeine and 
gas-producing foods such as beans. Patients 
were told that both diets had a good chance 
of helping them but, crucially, the researchers 
never used the term ‘“FODMAP. This was the 
largest randomized trial of the low-FODMAP 
diet, and the first to directly compare it with 
another active intervention. 

After four weeks, there was no difference 
between the groups — about half of the people 
on each diet reported improve- 
ment in their symptoms. The 


researchers concluded that eel) ‘AP 
the low-FODMAP diet was as Hicrienot 
effective for IBS as the estab- ; 

lished diet. But Paul Moayyedi, fife.” for 


a gastroenterologist at McMas- 
ter University in Hamilton, 
Canada, calls the results “completely negative’, 
as the low-FODMAP diet offered no advantage 
over previous dietary advice. 

Such disagreement about how to interpret 
dietary studies is common in the IBS commu- 
nity. “This is evidence that you actually have 
dietetic interventions that work,’ says gastroen- 
terologist David Sanders of Sheffield Teaching 


FODMAP FOOD MAP 


Some of these foods are prohibited on a diet that is low 
in the indigestible carbohydrates known as FODMAPs, 
but which ones? (Answer in the margin.) 
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Chicken 
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Hospital in the United Kingdom. Both the low- 
FODMAP diet and the conventional IBS diet 
give a 50% chance of improvement without 
using drugs — and keep the patient in control. 
“Tt's wonderful, isn't it, to have that?” he asks. 

But proponents and sceptics do agree that 
the low-FODMAP diet should be supervised 
by a health-care professional. Without proper 
advice, patients who give up high-FODMAP, 
but otherwise healthy foods, such as apples, 
artichokes and lentils, may struggle to have 
adequate fibre, vitamins and minerals in their 
diet. And some FODMAPs also encourage the 
growth of beneficial bacteria. 

“The low-FODMAP diet removes important 
prebiotics from the diet and has the potential 
to cause really quite large perturbations in the 
balance of the gut microbiome,” says Miranda 
Lomer, a dietician at King’s College London 
who collaborates with Whelan. 

For these reasons, avoiding all FODMAPs 
should only be a short-term undertaking. “A 
low-FODMAP diet is not a diet for life,” says 
Gibson. And it should not be adopted lightly. 
But the proliferation of dietary information on 
the Internet means that, in practice, this does 
not always happen. 

Peter Whorwell, a gastroenterologist at the 
University of Manchester, UK, estimates that 
half of his patients have put themselves on a 
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low-FODMAP diet without taking medical 
advice.“That is quite worrying,’ he says (see 
page S112). 


DIET DILEMMA 

Low-FODMAP is not the only diet of interest to 
the IBS community. To try to relieve their symp- 
toms, many people with IBS go gluten free — 
some studies have shown this to be effective in 
up to 70% of individuals with IBS. 

But the role of gluten in IBS is unclear, and 
most ‘gluten-free’ diets are really wheat-free 
diets, so the culprit could be other com- 
pounds found in wheat. Some studies point to 
a-amylase trypsin inhibitors, or say that people 
may be benefiting from reduced consumption 
of insoluble fibre, which is common in whole- 
grain breads and other cereal products. In 1994, 
Whorwell found that fibre worsened IBS symp- 
toms in more than half of patients. He suspects 
that fibre causes problems through similar 
mechanisms to FODMAPs, being fermented 
by gas-producing bacteria in the colon. 

Or maybe it’s all down to FODMAPs. Wheat 
contains fructans and is one of the most fre- 
quently eaten FODMAP-containing foods. In 
one study’, Gibson's team cycled people with 
IBS through periods on low-FODMAP, low- 
gluten and high-gluten diets. Their symptoms 
diminished only during the low-FODMAP 
phase. “We could find no evidence of any 
specific response to gluten,” he says. 

Whatever the reason for its success, the wider 
popularity of the gluten-free diet, particularly in 
the United States and Australia, makes it clini- 
cally appealing. “It’s easier to buy gluten free, it’s 
less of a restrictive diet, and it doesn't require as 
much dietetic support,’ says Sanders. He adds 
that some of his patients who improve on a low- 
FODMAP diet eventually go gluten free instead 
because it's easier. 

Eventually, scientists hope to identify 
biomarkers in the blood or the gut microbiota, 
or possibly both, to determine which dietary 
factors a particular patient is sensitive to, and 
what dietary changes might ease their symp- 
toms. But such an analysis is probably years 
away. In the meantime, those who treat IBS 
tend towards pragmatism. Even Moayyedi, who 
is sceptical of the lhw-FODMATP diet, recom- 
mends that some of his patients give it a go. He 
says that it helps about one-quarter of the time. 

Simrén says that despite little being known 
about the effects of gluten, he does not argue 
when patients say that gluten-free diets help. “T 
tell them, ‘I don’t quite know why you are better, 
but I'm happy that you are>” he says. “And if you 
feel that much better, you can continue.” = 


Sarah DeWeerdt is a freelance science writer 
based in Seattle, Washington. 
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Filling in the missing pieces 


For years, researchers have tried to find biomarkers that could aid the diagnosis of irritable 
bowel syndrome, and point to its underlying causes. But will one test ever be enough? 


BY ALLA KATSNELSON 


launched Mark Pimentel’s interest in irri- 

table bowel syndrome (IBS). Back then, 
despite the prevalence of the condition, people 
with IBS were often told that their symptoms 
were all in their heads. But Pimentel, then a gas- 
troenterology fellow at Cedars-Sinai, a medical 
center in Los Angeles, California, focused else- 
where in their bodies, on the rumblings in the 
abdomen: bloating, distension and flatulence 
— in short, gas. 

Pimentel wondered whether bacteria 
fermenting food in the gut were producing an 
excess of gas. In 2000, along with his mentor, 
Henry Lin, Pimentel reported that almost 80% 
of people with IBS harboured an overgrowth of 
bacteria in their small intestines’. For about half 
of these patients, eradicating the extra bacteria 
improved symptoms. Around that time, other 
groups were also making bacterial connec- 
tions — linking food poisoning to IBS. In the 
same vein, when Pimentel and his colleagues 
infected rats with the bacterium Campylobacter 
jejuni, acommon cause of travellers’ diarrhoea, 
they found that the animals developed not only 
IBS-like symptoms, but also similar bacterial 
overgrowth’. That’s where things began to get 
interesting, says Pimentel, who now directs 
the gastrointestinal-motility programme at 
Cedars-Sinai. “From that point on, we said, 


I was a hunch almost two decades ago that 
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‘OK, something in these bugs is causing IBS” 

The culprit, Pimentel says, is cytolethal 
distending toxin B (CdtB), a key part of a toxin 
produced by pathogens that cause gastroen- 
teritis. Last year, Pimentel reported that in 
a model of IBS, rats infected with C. jejuni 
formed antibodies to CdtB, which in turn trig- 
gered an autoimmune response against cells 
of the gut wall’. Furthermore, he found that 
blood-plasma levels of CdtB antibodies could 
help to distinguish people with diarrhoea- 
predominant IBS (IBS-D) from those with 
inflammatory bowel disease or coeliac dis- 
ease and from healthy controls*. With these 
results in hand, Pimentel and diagnostics com- 
pany Commonwealth Laboratories of Salem, 
Massachusetts, developed a blood test called 
IBSchek. “We've never had a biomarker that 
says, if youre positive, you have IBS,’ Pimentel 
says. “ This is that test.’ 

People with IBS need a diagnostic test. 
Patients get the label only when diseases that 
have similar symptoms, such as Crohn's dis- 
ease, coeliac disease or colorectal cancer, are 
excluded. The current system can leave peo- 
ple in diagnostic limbo — the average time 
to diagnosis is 6.6 years’, during which they 
shuttle between doctors and undergo mul- 
tiple tests, and still end up not quite being 
able to say that they have a disease (see page 
S112). Multiple attempts have been made to 
identify markers that could provide a positive 
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diagnosis, but with limited success. In 2009, a 
team that included Anthony Lembo, a gastro- 
enterologist at Beth Israel Deaconess Medical 
Center in Boston, Massachusetts, who went 
on to be a co-author of the 2015 CdtB-anti- 
body clinical study, launched a 10-biomarker 
blood test for IBS with pharmaceutical com- 
pany Prometheus Laboratories. But the test 
is no longer being sold. The main problem, 
says Lembo, was that the markers in the panel 
werent related to what caused IBS. 

Pimentel says that his test does just that 
— at least for IBS-D, which about one-third 
of people with the condition have. But not all 
gastroenterologists are convinced that IBSchek 
is as widely applicable as its developers say, or 
that it represents the underlying cause of the 
disorder. IBS is extremely heterogeneous, and 
so the test requires more validation in differ- 
ent groups of patients, critics say. Meanwhile, 
some researchers are focusing on this hetero- 
geneity to identify the underlying physiologi- 
cal differences that could separate IBS into 
distinct, diagnosable conditions. 


GREAT UNCERTAINTIES 

The term IBS was adopted in the 1960s, but 
even then clinicians struggled to define the 
condition (see page S102). “It had all these 
weird names and groupings, and there was 
no consensus about what these people expe- 
rienced,’ says Lembo. Today, the condition is 


SARAH J. COLEMAN 


diagnosed on the basis of symptoms using the 
Rome criteria (see page S107). 

The conventional diagnostic categories are 
functional: constipation-predominant IBS 
(IBS-C), IBS-D and mixed IBS. “That dis- 
tinction has nested itself into clinical practice 
because we have drugs that work specifically 
for IBS-D and IBS-C,’ says Jan Tack, a gastro- 
enterologist at the Univeristy of Leuven, Bel- 
gium. “But you could subdivide these people 
in many, many ways.’ For example, he says, IBS 
cases could be distinguished by whether they 
begin acutely or appear gradually, whether 
the predominant issue is pain or bloating, or if 
psychosocial symptoms such as anxiety are pre- 
sent. Whether and how any of these groupings 
align with physiological changes is not clear, 
but they generally warrant different treatment, 
says Tack. 

IBS symptoms seem to be driven by a tangle 
of factors that include irregularities of the 
central nervous system, problems in gut per- 
meability, impairment 
of the immune system 


“We are not at 
or a microbial imbal- ; 
ance in the gut (see page ohn oint to pi a 
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tify biomarkers that this test and 
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to detect inflammation, 

serotonin as a way to reveal an out-of-sync con- 
nection between the gut and the brain, and non- 
molecular measures such as gut hypersensitivity 
and brain imaging have all been explored. Even 
within a subgroup such as IBS-D, there seems 
to be a lot of variation in what might be causing 
the symptoms (see page $116). 

But researchers are now going beyond 
IBS-C and IBS-D, albeit slowly, and identi- 
fying subtypes that have a clear relationship 
with the underlying biology — and hence 


with biomarkers, says Magnus Simrén, 
a gastroenterologist at the University of 
Gothenburg in Sweden. For instance, IBS-D 
can arise when bile acid, which is produced by 
the liver to aid digestion, fails to be absorbed 
in the small intestine and ends up in the colon. 
This bile-acid malabsorption can be identified 
with the SeHCAT test, which is approved in 
Europe, but not in the United States. The con- 
dition responds well to targeted drugs, mak- 
ing a precise diagnosis worthwhile. “This is the 
way we can go today,’ says Simrén, “to define 
subgroups of IBS patients with biomarkers.” 

But Pimentel contends that his IBSchek 
is more widely relevant. Current estimates 
suggest that 10-20% of IBS-D cases follow 
gastroenteritis, but this is based on studies that 
asked people to recall past incidences of stom- 
ach upset. “Two days of mild food poisoning 
— you may not even remember that 20 years 
later, now that you have IBS,” Pimentel says. 
He suspects that 50-70% of IBS-D cases are 
post-infectious. 

Many IBS researchers acknowledge that 
current estimates of post-infective IBS may be 
low, but they disagree that it could be as high 
as Pimentel’s estimate. “To flip the coin com- 
pletely and say that the majority must be post- 
infectious — we would need much better data 
for that,’ says Tack. 

Pimentel’s hypothesis that an antibody 
generated by a bacterial toxin has autoimmune 
properties is sound, and well-supported by 
animal studies, Tack says. But he is not yet con- 
vinced that this antibody causes IBS, saying that 
there is no independent evidence that the par- 
ticipants in Pimentel’s 2015 study really did have 
post-infectious IBS. The CdtB antibody, he says, 
might be a kind of bystander marker: present, 
but not necessarily causative. People with IBS-D 
may have increased gut permeability, a condi- 
tion that allows all kinds of macromolecules to 
get into the submucosal tissue more easily and 
trigger an antibody response — no infection 
required. And Simrén notes that the IBSchek 
antibodies were found in people who had had 
the condition for a long time; whether the mark- 
ers can be detected when patients first present 
with symptoms is still to be demonstrated. 

Another issue with IBSchek is that the test is 
negative for about half of people who do have 
IBS, says Lin Chang, a gastroenterologist at the 
University of California, Los Angeles. What's 
more, she says, although IBSchek was able 
to differentiate IBS from ulcerative colitis, it 
was less successful with Crohn's disease and 
coeliac disease. Researchers such as Chang 
and Tack say that wider testing is needed 
before large-scale commercial deployment 
of IBSchek. Preferably, this should involve 
independent researchers working in a clini- 
cal setting with large groups of new patients 
presenting with diarrhoea and abdominal 
pain, and in geographically diverse popula- 
tions. “We need to see more extensive data,’ 
Tack says. “To tell a clinician now, “Use this 
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test and it will solve some of your problems. I 
don't think we're there yet.” 

Pimentel dismisses this concern. The 
antibody study, he points out, was conducted 
in patients from 180 medical centres and was 
larger than studies of many widely used diag- 
nostics. Moreover, he notes, there are several 
studies under way in different populations, and 
at least one, conducted by military researchers, 
has corroborated his findings’. 


BIG WORK 

Because IBS is such a heterogeneous condition, 
it will inevitably take large studies with thou- 
sands of people to understand the mechanisms 
of the condition — and hence how to diagnose 
it. In June, the US National Institute of Diabetes 
and Digestive and Kidney Diseases will lay out 
such a road map in their Functional Bowel Dis- 
orders Workshop, says Chang. The pan-Euro- 
pean network GENIEUR was launched in 2012 
to identify the factors that contribute to IBS. 

Beate Niesler, a molecular geneticist at the 
University Hospital of Heidelberg, Germany, 
who heads GENIEUR along with Simrén, 
says that the fact that each IBS drug only 
helps a small number of people indicates 
that “there are obvious sub-groupings of 
patients”. There are several early clues that 
genetics and epigenetics could help to parse 
these groups. Associations with IBS have 
been found in genes related to serotonin sig- 
nalling and immune, neuronal and intestinal 
barrier function. But most of the studies were 
small and the results may have been skewed 
by the different capabilities of each centre, 
Niesler says. She believes that GENIEUR, 
which spans 27 countries, can tease out and 
validate the various pathways by match- 
ing detailed data of clinical symptoms and 
phenotype with in-depth molecular analysis. 
The consortium has spent the past few years 
hammering out protocols, and now plans to 
conduct gene and RNA sequencing, analyse 
epigenetic markers and sequence the gut 
microbiome of 5,000 patients. 

Almost all IBS researchers agree on the 
need to achieve a clearer understanding of the 
biological basis of this condition. This would 
allow patients to be given a firm diagnosis 
and, in turn, treatment. Pimentel is pushing 
towards this goal using the markers that he 
says reflect the underlying cause of IBS. “Now 
that we know that this antibody is so impor- 
tant,’ he says, the question is, “if we can get rid 
of it, can we cure IBS?” = 


Alla Katsnelson is a freelance science journalist 
based in Northampton, Massachusetts. 
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Q&A Peter Whorwell 
A listening ear 


Peter Whorwell is Director of the South Manchester Neurogastroenterology Service and a gastroenterologist at the University Hospital of South 
Manchester, UK. He has been researching and treating patients with irritable bowel syndrome (IBS) for more than 35 years, and has seen the depths 


of despair that this condition can cause. 


What’s it like to have IBS? 

When I was a medical student in the 1970s, 
we were told that people with IBS were 
oversensitive females and that they shouldnt 
be taken too seriously. This erroneous per- 
ception of IBS has unfortunately persisted. 
But the truth is that IBS is a genuine condi- 
tion that affects millions of people, men as well 
as women. It’s actually quite common: about 
15% of adults in the United Kingdom have 
it. Typically, patients have abdominal pain, 
bloating and bowel problems: some people are 
constipated, others have diarrhoea, some flit 
between the two. But they may also experience 
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backache, fatigue or nausea, and have bladder 
and/or gynaecological problems. For some 
people, the symptoms are manageable, but for 
others they can be absolutely devastating. 

I see the latter, patients with very severe 
IBS. Half of the people who come to my clinic 
have faecal incontinence. Some nights, they 
wake up in a pool of faeces. Women can have 
incontinence during intercourse. Sex is pain- 
ful. Sometimes the bloating is so bad that they 
look heavily pregnant and the pain is so severe 
that they liken it to labour, but at least labour 
comes to an end. Over time, their sense of their 
own femininity is eroded. For some people, the 
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symptoms are so severe and restrictive that 
they can't leave the house. The sick days rack 
up and employers are unsympathetic. Some of 
my patients who also have cancer tell me that 
having IBS is worse. People with IBS can feel 
totally hopeless — and still they are told that 
their symptoms are all in their head. 


What happens when someone with these 
symptoms goes to see their doctor? 

The first interaction is absolutely crucial. It can 
set a patient on a good or a bad journey through 
the health-care system. What I have observed in 
the United Kingdom and many other countries 


GRANT WHITEHURST, UNIV. HOSP. SOUTH MANCHESTER 


is that, because general practitioners (GPs) 
are pushed for time, when someone with IBS 
describes their range of symptoms the doctor 
tends to focus on the one that is most troubling. 
Ifit's bladder problems, for example, they might 
prescribe antibiotics. If it’s menstrual problems, 
they might send them to a gynaecologist. The 
result is that the IBS goes unrecognized, and 
treatments given to ease these other symptoms 
can sometimes backfire. Antibiotics, for exam- 
ple, can make IBS worse. Surgical interven- 
tions can aggravate the already hypersensitive 
gastrointestinal tract. 

In the worst cases, patients end up 
ricocheting between their GP and various 
specialists. A person can end up having mul- 
tiple scans, colonoscopies and gastroscopies, 
all of which help to exclude serious diseases 
such as an ulcer, Crohns disease or cancer, but 
which leave the patient no closer to a defini- 
tive diagnosis or to having their symptoms 
eased. At this point, they are often told either 
that there is nothing wrong with them, or that 
it’s probably IBS and that they need to learn to 
live with it. It’s only after a lot of back and forth 
that patients eventually end up seeing someone 
like me, who specializes in severe cases of IBS. 


Why is it so difficult to get a definitive 
diagnosis of IBS? 

Doctors are programmed to do tests and to 
base their diagnoses on the results. But there 
is no test for IBS. It is a collection of symptoms 
—asyndrome, not a disease (see page $110). I 
dont do tests. I listen. I sit there and I let people 
talk. Then I piece together their symptoms and 
make a clinical judgement. Not everyone has 
the confidence to do this. 

Sometimes people with IBS do receive a 
diagnosis from a GP or a specialist, but it’s 
rarely unequivocal and the treatments that fol- 
low are not always successful. For example, my 
colleague Carol Francis, who now works at the 
Countess of Chester Hospital, and I showed 
that fibre-rich diets make only 10% of peo- 
ple with IBS feel better; 55% felt worse (C. Y. 
Francis & P. J. Whorwell Lancet 344, 39-40; 
1994). Yet half of all doctors still preach the 
myth that people with IBS should eat more 
fibre. Sometimes medication is prescribed. 
Antispasmodics can alleviate pain by relaxing 
the bowel muscle, and laxatives and antidiar- 
rhoea medication can be used to control bowel 
symptoms. Old-fashioned tricyclic antide- 
pressants, which also have an analgesic effect, 
can help, but by the time they are offered, peo- 
ple with IBS are so used to hearing that their 
illness is all in their head that often they don't 
want to take them. 


How do you help people who have severe IBS? 

I am a tertiary-care specialist. GPs and 
secondary-care specialists send me people 
who have extreme and persistent IBS. I see 
about 2,000 patients a year. Many of them 
will have been stuck in the health-care system 


for 5 to 10 years. These are the people who 
are affected so badly that they are ceasing to 
function socially and in the workplace; 38% 
of them have even contemplated suicide 
(V. Miller et al. Clin. Gastroenterol. Hepatol. 
2, 1064-1068; 2004). 

By the time they find me, they are desperate 
for a diagnosis, which I can usually give them 
in our first session. I tell them I cannot cure 
them, but that I can help them if they help me 
by being engaged and involved. 

I start by putting them on a very strict diet. 
The FODMAP diet, which excludes certain 
non-digestible carbohydrates, is currently very 
popular for IBS (see 


page S108).I prescribe “For some 

adiet thatsevenmore people, the 
extreme and restric- symptoms are 
tive than FODMAP.  ggsevere and 
Under my guidance, restrictive that 
the patient gradually they can’t leave 


reintroduces foods to 
see what can be toler- 
ated. It’s an approach that helps about half of 
my patients. 

Next, I look at medication. I try different 
drugs, doses and combinations. If I suggest a 
tricyclic antidepressant, I explain that the low 
dose being offered is not for depression — most 
of my patients are not depressed, which is amaz- 
ing when you think how awful their symptoms 
are — but to reduce pain. Amitriptyline, for 
example, helps more than half of my patients. 


the house.” 


Can hypnotherapy help? 

Yes, it can. That's why we offer this treatment 
after tinkering with prescriptions and diet. I 
have a team of specialized hypnotherapists 
who work with me. Last year, we reported that 
of 1,000 people with hard-to-treat IBS, 76% 
experienced a positive effect with hypnother- 
apy, with symptom severity reduced by half 
(V. Miller et al. Aliment. Pharmacol. Ther. 41, 
844-855; 2015). This study adds to a catalogue 
of more than two dozen papers that we have 
published showing that hypnotherapy has a 
positive effect for people with IBS. 


Doesn’t the use of hypnotherapy reinforce the 
idea that IBS is all in the mind? 

Not at all. This isn’t some sort of mind-control 
stage hypnotism. Gut-focused hypnotherapy, 
as we call it, is about inducing a state of deep 
relaxation and teaching people to take control 
of their very real, physiological symptoms. 
Hypnosis has been shown to reduce symptoms 
such as gastrointestinal sensitivity, the strength 
of muscular contractions in the gut and gas- 
tric-acid secretion in people with IBS. We use 
imagery. If someone feels bloated, we might 
ask them, under hypnosis, to imagine a bal- 
loon being deflated. If someone is constipated, 
we might ask them to visualize a river flowing 
slowly and then speed it up. The hypnothera- 
pists at the centre leave patients with post-hyp- 
notic suggestions. If the patients start to feel 
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pain, for example, the simple act of laying their 
own hands on their belly should help to ease 
it. We're even starting to give hypnotherapy 
through Skype now. 


Does this type of alternative therapy have any 
drawbacks? 

As the technique becomes better known, more 
people are offering it as a treatment, and they 
sometimes do so without any qualifications or 
knowledge about IBS. That worries me. Some 
even claim to have been trained by me, but I’ve 
never met them. We're giving hypnotherapy a tre- 
mendous reputation that charlatans can live off. 


What other treatments for IBS are there in the 
pipeline? 

We're hoping to run a trial for faecal microbial 
transplants in the neurogastroenterology unit 
next year. Some people with IBS are thought 
to have an imbalance in their gut microbiota 
(see page S104). Wipe out that community of 
microbes and replace it with a healthy cocktail 
of organisms, the argument goes, and IBS may 
improve. In the United States, there are a few 
anecdotal reports of people with IBS whose 
symptoms have improved after faecal trans- 
plants, so we think it is worth trying. We're still 
very much at the formative stage and have yet 
to apply for ethical approval. My fear, however, 
is that we may be rejected because regulators 
regard IBS as a trivial disorder. 


What’s your assessment of the future of 
understanding and treating IBS? 

I’m very gloomy about the future. IBS is the 
most common gastrointestinal disorder, and yet 
it is still heavily stigmatized. Patients feel that 
they get brushed off by the medical commu- 
nity, which considers IBS to bea mild, non-life- 
threatening condition. Research funding is hard 
to come by, and regulators are reticent to fund 
trials for therapies, because it is ‘only IBS. But 
IBS does kill people; they take their own lives. 
That it’s only IBS is a dangerous misperception 
that has to change. 

My hospital has dozens of respiratory 
physicians, cardiologists and all sorts of other 
specialists, but just one IBS specialist — me. 
I worry about what will happen when I, and 
others like me, retire. Two of the five hypno- 
therapists I work with have retired and are 
not being replaced. Young gastroenterolo- 
gists don't want to do what I do — they want 
to investigate people with endoscopes. What 
I do mainly involves listening to people and 
encouraging them, by a process of trial and 
error, to manage their condition. I am usually 
able to help them — help them to leave the 
house, help them to return to work and help 
them to get their life back. I hope that this kind 
of approach is able to continue in the future, 
because it is amazing how often it works. = 


INTERVIEW BY HELEN PILCHER 


This interview has been edited for length and clarity. 
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Svartediket reservoir (right) was the source of a 2004 giardiasis outbreak in Bergen, Norway. 


INFECTIOUS DISEASE 


Something in the water 


Infections have long been thought to cause irritable bowel syndrome, but it has not been 
clear how. It seems that pathogens might be changing the behaviour of our gut microbiota. 


BY SUJATA GUPTA 


primary-care physician in Bergen, Norway, 

was concerned about a noticeable rise in 
the number of patients complaining of per- 
sistent diarrhoea, flatulence and nausea. She 
began discussing the problem with colleagues 
over lunch breaks, but it wasn't until she got a 
call from the doctor across the road who had 
patients with the same symptoms that she 
really began to worry. 

That doctor, Knut-Arne Wensaas, told 
Rortveit that one of his patients had tested 
positive for the protozoan parasite Giardia 
lamblia. People become sick with giardiasis, as 
the resulting disease is called, when they con- 
sume contaminated food or water. A common 
cause of travellers’ diarrhoea, Giardia is rare in 
developed countries such as Norway, where it 
is almost always related to travel overseas. 

But Wensaas’ patient had not left the country 
or done anything else to put herself at risk. 


lE October 2004, Guri Rortveit, a 
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Concerned, Wensaas and Rortveit asked the 
other nine doctors in their practices to request 
microscopic analyses of stool samples from 
all of their patients with diarrhoea. Realizing 
that something was amiss, the lab workers who 
processed the requests at the Haukeland Uni- 
versity Hospital began testing other stool sam- 
ples to see if these also contained the parasite. 

On 29 October, the hospital notified city 
officials of a giardiasis outbreak. Within days 
the source was traced to one of the city’s main 
reservoirs, located some 3 kilometres from 
Rortveit’s office. Local residents were advised 
to boil their tap water, and the outbreak was 
finally contained. 

About 1,300 people were diagnosed with 
giardiasis, and there were another 1,200 uncon- 
firmed cases. In about half of these 2,500 people, 
the symptoms persisted for months, sometimes 
after repeated treatment, even though tests 
showed that the parasite was gone. “We knew 
then that there was something else going on,” 
says Rortveit, who is also a family medicine 
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researcher at the University of Bergen. 

The Bergen case added to a body of work 
showing that infections can trigger a form 
of irritable bowel syndrome known as post- 
infective (PI) IBS, which represents 6-17% 
of all IBS cases. There are several hypotheses 
about how this happens, but it seems that 
infections can lead to long-term changes in 
the gut. Understanding how IBS develops 
could provide clues to how some cases might 
be treated, or even prevented. 


LINGERING ILLNESS 
Even before IBS had a name, it was suspected 
that infection could trigger prolonged gastro- 
intestinal problems (page S102). In his 1918 
second edition of Medical Diseases of the War, 
British physician Arthur Hurst wrote: “Patients 
who have recovered from an acute attack of 
dysentery frequently remain unfit for a consid- 
erable period, which may even extend to years.” 
Over the next few decades, evidence slowly 
accrued for a link between infection and 
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chronic bowel problems. Then, in 1962, Sidney 
Truelove and Nazir Chaudhary of the University 
of Oxford, UK, studied 130 patients with irri- 
table colon syndrome, as IBS was then called. 
They found that one-quarter of patients had 
developed chronic bowel problems after a bout 
of gastroenteritis (which many acquired while 
they were serving overseas during the Second 
World War). 

This finding was largely ignored until the 
1990s, when several researchers started inves- 
tigating the phenomenon in earnest. Nick Read 
and Kok Ann Gwee, both then at the University 
of Sheffield, UK, were intrigued by the obser- 
vation that those who developed PI-IBS were 
more likely to have psychological problems than 
those who recovered from their gastroenteritis. 

Meanwhile, Robin Spiller, a gastroenterologist 
at the University of Nottingham, UK, was inter- 
ested in how pathogens 
triggered PI-IBS, and 


whether this was distinct IBS was 

from other forms ofthe °UPP osed to 
condition. Spiller’s team be acondition 
studied 544 people who where there 
had confirmed diar- Weren tany 
rhoeal infection and Clianges inthe 
found that one-quarter Sul. ° 


said that their bowels 

had not returned to normal after 6 months. In 
patients who developed IBS, the most com- 
mon pathogens were bacteria from the genus 
Campylobacter (R. C. Spiller et al. Gut 47, 
804-811; 2000). Furthermore, rectal biopsies 
revealed an increase in T lymphocytes that 
lasted at least three months after the infection 
had passed — a sign of a prolonged inflamma- 
tory response. The inflammation seemed to 
increase the number of serotonin-containing 
cells in the gastrointestinal tract, which is a 
potential trigger for diarrhoea. 

When Spiller’s team published this work, the 
prevailing wisdom was that the brain drove the 
bowel symptoms found in IBS. But here was 
evidence that, at least for PI-IBS, the problem 
lay in the gut itself. “IBS was supposed to be a 
condition where there weren't any changes in 
the gut — and here I was finding them,’ Spiller 
says. “It certainly went against the accepted 
dogma of the time.” 


TOXIN TROUBLE 
Other researchers linked PI-IBS not only with 
Campylobacter but also with Escherichia coli, 
Salmonella and Shigella bacteria, all of which 
produce a toxin called CdtB. Mark Pimentel, 
a gastroenterologist at Cedars-Sinai, a medical 
centre in Los Angeles, California, showed that 
CdtB cues the development of antibodies that in 
turn cause an autoimmune response. This alters 
gut motility and initiates the symptoms common 
in people with IBS. Pimentel’s work suggests that 
in up to half of these patients, CdtB also leads to 
an overgrowth of bacteria in the small intestine. 
But Giardia does not produce CdtB, 
making the situation in Bergen all the more 


mysterious. Nevertheless, the parasite 
somehow triggers the same symptoms as CdtB 
seems to, as Rortveit and her team have shown 
(K.-A. Wensaas et al. Gut 61, 214-219; 2011). 
For each confirmed giardiasis case, they iden- 
tified two uninfected local people matched by 
age and sex but otherwise randomly selected, 
and found that 46% of patients still had chronic 
bowel problems 3 years after infection, com- 
pared with just 14% in the control group. 

“It’s not clear how Giardia might trigger 
IBS,’ Pimentel says. “It throws me offa little.” 


ZOMBIE BACTERIA 

Andre Buret, a pathophysiologist at the 
University of Calgary in Canada, 
thinks he might know how 
Giardia triggers IBS. After 
hearing about what hap- 
pened in Norway, he 
decided to see whether 
pathogens such as 
Giardia alter the body's 
microbiota. He used a 
technique that he and 
his team had developed 
several years earlier that 
allows researchers to 
grow a human gut micro- 
biota outside the body and 
watch what happens. 

Gut bacteria inhabit a harsh 
environment that is subject to 
constant flushing and exposure 
to foreign substances. To survive, 
the bacteria create communities 
known as biofilms that are coated with protec- 
tive polysaccharides. Buret wondered whether 
some pathogens might modify these bacterial 
films, inducing abnormalities that persist after 
the pathogen is gone. To test this idea, he put 
some microbial samples in a dish and infected 
them with Campylobacter and Giardia. 

Buret’s team found that Giardia and 
Campylobacter disintegrate the protective bio- 
films, so the once-beneficial bacteria can move 
to other parts of the gut where they become dis- 
ruptive. Buret has also shown that these patho- 
gens can alter the gene expression of freed gut 
bacteria, making them toxic. This interaction, 
he says, “transforms them into pathobionts” — 
the bacterial equivalent of zombies. 

Like their human equivalents, zombie 
bacteria wreak havoc. In an unpublished 
study on germ-free mice, Buret found that 
these pathobionts target and kill the cells of 
the small intestine. Long after the instigating 
pathogen has gone, the actions of the pathobi- 
onts continue to cause gut inflammation. And 
with no way to get rid of the zombie menace, 
the inflammation can persist indefinitely. 


MILITARY MIGHT 

Cases of PI-IBS are particularly common 
in members of the armed forces who have 
served overseas. “Within the military, this 
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Giardia lamblia, the 
parasite responsible 
for giardiasis. 


is an underappreciated burden of disease,” 
says Mark Riddle, a vaccine researcher at 
the Naval Medical Research Center in Silver 
Spring, Maryland. 

Military service exposes people to not one 
but two major risk factors. First, many of those 
who serve overseas develop travellers’ diar- 
rhoea, which is one of the main causes of illness 
and lost duty days. And second, as the Sheffield 
researchers confirmed back in the 1990s, people 
are at greater risk of developing IBS if they have 
psychological problems — and many soldiers 
have post-traumatic stress disorder. 

This knowledge is already affecting the 

hunt for treatments for PI-IBS (see page 
S116). Spiller and his team have 
had promising results with 
the anti-inflammatory 
drug mesalazine (C. 
Lam et al. Efficacy 
Mech. Eval. http:// 
dx.doi.org/10.3310/ 

eme02020; 2015). 

They gave the drug 

to 136 people with 

IBS. It had no effect 

overall, but the 

13 people with PI-IBS 
reported less diarrhoea 
and cramping. The find- 
ing needs to be confirmed, 
Spiller says, but it supports the 
idea that the symptoms of IBS may 
arise from persistent activation of 
the immune system. 

In 2015, Pimentel’s work led 
to the approval by the US Food and Drug 
Administration of rifaximin — an antibiotic 
that was already used to treat travellers’ diar- 
rhoea — as a treatment for PI-IBS. Pimentel 
has also developed a blood test that looks for 
the presence of CdtB, and says that this could 
be an important step towards finding a cure. 

Riddle has been trying to take things one 
step further with a preventive vaccine, and has 
conducted several small-scale trials of prom- 
ising candidates. Volunteers receive either a 
placebo or a trial vaccine, and then consume a 
drink that contains a pathogen. He intends to 
test promising candidates on travellers within 
the next five years. Riddle also notes that those 
who do develop diarrhoea are immediately 
treated with antibiotics — and no one has gone 
on to develop IBS. 

Despite the apparent success of antibiotic 
treatments, many researchers are cautious 
about using them to treat PI-IBS. When the 
microbiota have already been ravaged by 
infection, is it helpful to use drugs that fur- 
ther target the fragile intestinal ecosystem? 
“Knowing how precious one’s microbiome 
is, says Spiller, “I would be very reluctant to 
disturb it” = 


Sujata Gupta is a freelance science writer 
based in Burlington, Vermont. 
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BY BRANWEN MORGAN 


fter decades without any specific drugs 
Ae irritable bowel syndrome (IBS), 

the first few years of the millennium 
brought hope in the form of two approvals. But 
this new era turned out to bea false dawn. 

In February 2000, alosetron became the first 
drug to be approved by the US Food and Drug 
Administration (FDA) for use in women who 
had IBS with diarrhoea (IBS-D). But within 
a few months there were reports of serious 
complications among people who were tak- 
ing the drug and several deaths. Alosetron was 
withdrawn that November. 

In 2002, the FDA approved tegaserod for 
IBS with constipation (IBS-C). But this drug 
also encountered problems. By 2007, it too had 
been withdrawn, this time over concerns of an 
increase in risk of cardiovascular problems. IBS 
drug development was not going well. “There 
was a general waning of enthusiasm on the part 
of pharma,’ says Alex Ford, a gastroenterologist 
at the University of Leeds, UK, “because of the 
bad experience with alosetron and tegaserod’. 

Enthusiasm was also dampened by an incom- 
plete understanding of the multiple mecha- 
nisms that drive IBS. It is not one disease, but 
rather a condition comprised ofa constellation 
of symptoms, and its diagnosis depends on the 
exclusion of other possible causes (see page 
$110). Localized gut problems include altered 
bowel function, bloating and abdominal 

pain, and for many people, anxiety and 

stress can cause these symptoms to wax and 
wane. It is difficult to stratify patients for clin- 
ical trials, to develop appropriate animal models 
or to identify genuine drug targets. 

But hundreds of millions of people — around 
11% of the global population (see “Global vari- 
ety ) — have IBS, so there is a considerable need 
for treatments. And slowly the hurdles are fall- 
ing. The FDA has approved three drugs for IBS 
within the past four years, and there is a bulg- 
ing drug pipeline ahead with numerous com- 
pounds — some of which are already approved 
for other conditions — that target all manner 


wanes of mechanisms from gut neurotransmission to 
faecal composition. 


ty 
; 2a 
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THE SEROTONIN STORY 
Because people often have either IBS-D 
or IBS-C, the treatment mainstays have his- 
torically been antidiarrhoeal agents and laxa- 
tives, which can ease some of the functional 
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A healthy pipeline 


Drugs for irritable bowel syndrome have so far been limited, 
but a promising stream of options could soon enter the market. 


issues, but do nothing to target the underlying 
mechanisms. In the 1970s, physicians began 
reporting that certain serotonin-targeting 
antidepressants were effective at slowing down 
bowel movement, even at low doses. And, as 
the evidence mounted, drug developers started 
to look to the serotonin system for ideas. 

The gastrointestinal tract produces about 
95% of the body’s serotonin; this neurotrans- 
mitter is involved in gut motility, intestinal 
secretion and visceral sensitivity — all cru- 
cial elements of IBS symptoms. “Selective 
serotonin re-uptake inhibitors and tricyclic 
antidepressants have been effective in treat- 
ing global IBS symptoms,” says Baharak 
Moshiree, a gastroenterologist at the Univer- 
sity of Miami in Florida. 

There are seven main classes of serotonin 
receptors, some of which also have subtypes 
or variants, making for a complicated pic- 
ture. The two classes most relevant to the 
gastrointestinal tract are 5-HT,; and 5-HT,. 
Alosetron is an antagonist of 5-HT,; — it 
blocks the receptor’s action and so slows the 
passage of stool through the gut. But in many 
ways it works too well. “Alosetron was a case 
of classical receptor pharmacology where you 
make a drug that binds to the one receptor 
you want to target — and it binds really tightly 
and really well,’ says neuroscientist Paul 
Bertrand at RMIT University in Melbourne, 
Australia. It binds so tightly that long-term 
use will block the receptor indefinitely. “This 
was obviously its downfall because it stayed 
there and many people became overly con- 
stipated,” Bertrand says. Alosetron was made 
available again in 2002, but only for women 
with severe, uncontrollable IBS-D, and with 
tight restrictions — which were eased earlier 
this year. 

Tegaserod is a 5-HT, receptor agonist — 
rather than blocking the receptor, the drug 
activates it and increases the signals sent to 
the circular muscles that wrap around the 
colon, speeding transit. Bertrand says that 
tegaserod was meant to be safer than alose- 
tron because it is a partial agonist, so it doesn't 
elicit a full response from the receptor. “Its 
effect on the heart was unexpected,’ he says. 
“We now know it has some affinity to other 
serotonin receptors.” 

Serotonin receptors are found throughout 
the central and peripheral nervous system, 
and so there is a risk of adverse events from 
systemic exposure to serotonergic drugs. 


SARAH J. COLEMAN 


SOURCE: C. CANAVAN ET AL. CLIN. EPIDEMIOL. 6, 71-80 (2014). 


Women seem to have more of some types of 
serotonin receptor than men, so men may 
have a different clinical response. Despite 
the early setbacks, serotonin remains one 
of the most attractive targets for IBS drug 
development. “The overactivity or underac- 
tivity of the enteric nervous system is really 
the basis of IBS as far as we understand it,” 
explains Bertrand. 

The current IBS drug pipeline includes 
several 5-HT, agonists for use in patients 
with IBS-C, but with one crucial difference 
from tegaserod. “The greater specificity of the 
newer drugs suggest they'll be much safer,” 
says gastroenterologist Michael Camilleri at 
the Mayo Clinic in Rochester, Minnesota, and 
president of the American Gastroenterologi- 
cal Association. One such agent is prucalo- 
pride, which is highly specific for the 5-HT, 
receptor and has so far not been associated 
with any heart issues. It was first approved 
in Europe in 2009 for chronic constipation 
in women, and extended to use in men in 
2015. It is available in Canada, but it is not yet 
approved in the United States. Clinical trials 
to extend its use to IBS are ongoing. 

Alternatives to alosetron are also in the 
pipeline. The anti-nausea drug ramosetron, 
which was originally approved in East Asia 
in the 1990s for use in people with cancer, 
is also a 5-HT, antagonist, but it is much 
more potent than alosetron and so can be 
given at lower doses. In 2008, its use in Japan 
was extended to men with IBS-D, and so far 
there have been no reports of severe side 
effects. Although not currently approved for 
women, ramosetron has been shown to also 
improve their IBS symptoms (S. Fukudo et al. 
Gastroenterology 150, 358-366; 2016). 


VARIED TARGETS 

Fundamentally, IBS involves a problem 
with how the bowel functions. Because of 
this, many drugs aim to treat symptoms 
by changing the local environment in the 
gut — mainly by modifying fluid levels to 
affect the speed of faecal transit. Lubipros- 
tone, which was approved by the FDA for 
IBS-C in 2008, activates chloride channels 
to increase fluid levels. Linaclotide, which 
was approved in the United States in 2012, 
targets the enzyme guanylate cyclase, and 
not only stimulates the release of ions and 
water, but also relieves pain. Diarrhoea is a 
common side effect, however. Plecanatide, 
which is still in phase III trials for IBS-C, 
activates the same receptor as linaclotide, 
but is a weaker agonist — this should reduce 
the chance of rebound diarrhoea. 

Around one-quarter of IBS-D cases are 
caused by excess bile acid, which is produced 
by the liver to help with the digestion of fats. 
Bile-acid balance can be affected by changes 
in the gut microbiota and by immune dys- 
regulation, which is why anti-inflammatory 
drugs and antibiotics, such as rifaximin 


GLOBAL VARIETY 


The global prevalence of irritable bowel syndrome 
varies hugely, which may reflect access to health 

care or stigma surrounding the condition. The 
highest estimated rates are shown, but within 
countries the reported rates vary dramatically. 
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(approved in 2015), may ease IBS symptoms. 
Problems also occur when too much bile acid 
is produced, or if too little is absorbed in the 
small intestine. Three bile-acid modulators 
are in early-stage clinical trials for IBS-D — 
colesevelam and colestipol, which absorb 
bile acid (and are already approved as choles- 
terol-lowering agents), and obeticholic acid, 
which reduces produc- 


tion of bile acidandis “The greater 
set to launch later this specificity 
year to treat an auto- of thenewer 
immune liver disease. drug's sug gest 
Bile-acid levels can also they’Il be 


be increased to help 
with IBS-C. Elobixi- 
bat has completed phase II trials for IBS-C 
in Europe and the United States; it partially 
inhibits absorption of bile acid, allowing more 
to pass into the colon. 

Opioids have long been known to affect 
stool composition; constipation is a rec- 
ognized side effect of morphine use. The 
u-opioid receptor agonist loperamide is a 
well-established over-the-counter medication 
for general diarrhoea. It works by slowing 


much safer.” 
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IRRITABLE BOWEL SYNDROME (ueyyeele) 


down bowel contractions, which allows the 
intestines more time to absorb fluid, but it can 
also cause constipation. Eluxadoline, the sec- 
ond drug approved by the FDA for IBS-D last 
year, also stimulates the p-opioid receptor, 
but at the same time dampens activity of the 
6-opioid receptor. Pamela Hornby, lead sci- 
entist on the gastrointestinal discovery team 
at the pharmaceutical company Janssen near 
Philadelphia, Pennsylvania, which developed 
eluxadoline, explains that this contrasting 
action helps to “normalize gastrointestinal 
transit without causing rebound constipa- 
tion”. Adding, “it helps with pain, too”. 


BETTER TOOLS 

One of the reasons that these locally acting 
drugs are popular with drug developers is that 
their effects are easier to see in animals. “We 
have quite a good model for motility and mus- 
cle physiology,’ says Jakub Fichna, a biochem- 
ist at the Medical University of Lddz, Poland. 
The symptoms that lack robust IBS models, 
however, are also those that have the greatest 
unmet medical need. For example, pain, which 
may or may not be associated with a change in 
bowel function, is not easy to quantify — and 
is even harder to target at a molecular level. 

In an effort to meet this need, Fichna’s group 
has adapted a mouse model of colon sensitiv- 
ity to allow the team to replicate the sensation 
and pattern of visceral pain associated with 
IBS. Fichna suspects that multiple signals are 
involved in these sensations, involving systems 
found throughout the body. “We were working 
mostly in opioids and cannabinoids, but now we 
are also looking at nociception and are slowly 
entering into the serotonin area,’ he says. “Prob- 
ably all of these are involved in visceral pain.” 
The challenge will be to establish whether it is 
possible to isolate the actions of each of these 
systems in the gastrointestinal tract and, if they 
can be, whether activation of particular recep- 
tors relieves or exacerbates IBS symptoms. 

IBS drug development has come a long 
way in the past few years. With so many treat- 
ments in the pipeline, now is an exciting time 
to be an IBS researcher, says Ford. For clini- 
cians like Moshiree, it is a relief to finally have 
more treatment options available. “Patients 
usually know within a few weeks if a drug is 
going to work,’ she says, “and if they dont, you 
need something else to put them on.” A better 
understanding of the multitude of mechanisms 
behind IBS will be pivotal, particularly in rela- 
tion to how diet, anxiety and stress influence 
symptoms. Summing up the feeling within 
the community, Fichna says that pharmacol- 
ogy is not enough when you want to combat 
IBS because there are so many other factors 
involved. “There is no single target, no single 
magic pill” He relishes the challenges ahead. 
“This is part of the adventure.” = 


Branwen Morgan is a freelance science writer 
based in Sydney, Australia. 
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COUPES IRRITABLE BOWEL SYNDROME 


There is a long list 
of unknowns about 
the causes and 
symptomatology of 


What are the 
underlying 
mechanisms of the 
syndrome? 


What is the 
relationship between 
the disorder and 
mental health? 


How important is diet 
in driving IBS? 


Can the syndrome 
be definitively 
diagnosed? 


N 


The variety of symptoms 
and of patient responses 
to treatment suggest that 


there are multiple underlying 
~ causes of IBS. Identifying 


specific mechanisms will 
help researchers to develop 
better treatments. 


For treatment purposes, 
it is important to know 
whether IBS starts in 


the gut and then 


causes psychological 
issues, or whether 
IBS symptoms have a 
neurological trigger. 


Diet seems an obvious 
starting point for 
treating IBS. Care will be 


improved with a better 
understanding of which 


subset of patients could 
benefit from dietary 
interventions and why. 


An IBS diagnosis is based 
on symptoms and on 
the elimination of other 


disorders. This is stressful 
' for patients, frustrating for 


clinicians and inefficient 
for the health-care 
system. 


S 


irritable bowel syndrome 
(IBS). But the research 


challenges can be distilled 
into four key areas. 


BY MICHELLE GRAYSON 


The gut can only react 
to Insults In certain ways 
(for example, diarrhoea, 


constipation or cramps). 
~ But these symptoms 


could be the end point 
of several different 
diseases. 


Anxiety and depression 
sometimes accompany IBS. The 
serotonin system is implicated 


in the syndrome, and plays a 
part in neurological conditions. 


IBS is more common in service 
personnel who have post- 
traumatic stress disorder. 


The fermentation of 
certain food components 
by microorganisms in 


. the gut exacerbates 
" symptoms. Diet can 


ease these, but there 
is little evidence of a 
causal link. 


Attempts to develop 
biomarkers for IBS have been 
disappointing. Breath tests and 


-. diagnostics that involve serum 
~ biomarkers are in development, 


but these have yet to translate 
to the clinic and many 
researchers remain sceptical. 


Researchers have already 
identified distinct causes 
of IBS, such as bile-acid 


> malabsorption. Large-scale 
~ initiatives such as Europe’s 


GENIEUR consortium are 
trying to tease out genetic 
and epigenetic contributions. 


Better grouping of patients 
by phenotype will help 
researchers to pick out signals 


in large-scale trials. In the 
meantime, clinicians need 


to be careful not to assume 
that patients have mainly a 
psychological problem. 


Larger, more rigorous trials 
will help to isolate the effect of 
food components. By working 


closer together, microbiologists, 
' gastroenterologists and 


nutritionists could improve 
knowledge of the behaviour of 
the gut microbiota. 


Clearer patient 
phenotypes will 
help researchers to 


identify more specific 
~ biomarkers, which can 


also be used to stratify 
patients for clinical 
testing of targeted drugs. 


Michelle Grayson is senior supplements editor at Nature. 
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DECADE IN REVIEW—FGIDS 


NEWS & VIEWS 


‘Functional’ gastrointestinal 
disorders—a paradigm shift 


Nicholas J. Talley 


In the past decade we have witnessed an explosion in the quantity and 
quality of research in the functional gastrointestinal disorders. | discuss 
10 top original research papers that, unless recent, have been highly 
cited, published in a high-impact journal and have probably shifted 


thinking in the field. 


Talley, N. J. Nat. Rev. Gastroenterol. Hepatol. 11, 649-650 (2014): published online 23 September 2014; 


doi:10.1038/nrgastro.2014.163 


Post-dysenteric IBS was described by 
Chaudhary and Truelove in 1962—the 
initial clue that IBS might be an organic 
gut disease. Although important work 
has further characterized post-infectious 
IBS, Mearin and colleagues’ prospective 
study' from 2005 identified convincingly 
an increased incidence of dyspepsia, IBS 
and an overlap syndrome (dyspepsia-IBS) 
6 and 12 months after a salmonella out- 
break; patients with more severe gastro- 
enteritis were at increased risk of developing 
one of these syndromes.' The overlap 
of functional gastrointestinal disorders 
(FGIDs) has remained mysterious but 
these key data suggest that infection might 
explain it. Salmonella spp. induces small- 
intestinal inflammation; a testable hypoth- 
esis is that perhaps the site of inflammation 
accounts for the symptom phenotype, for 
example proximal inflammation might be 
associated with dyspepsia, distal inflam- 
mation with IBS and more-extensive 
inflammation with the overlap syndrome. 


GG One could make the case it 
is now time to drop the arguably 
pejorative term functional... 9 


How might inflammation induce the 
symptoms of FGIDs? And what about 
the mysterious extraintestinal symptoms 
these patients often report? Liebregts et al.’ 
tested whether the release of proinflam- 
matory cytokines in IBS is linked to symp- 
toms and psychiatric comorbidity. They 
found that a number of cytokines were 


increased in IBS, including TNF and IL-6, 
and that lipopolysaccharide-induced TNF 
levels correlated with levels of anxiety.’ 
These data further challenge the concept 
that IBS is a functional disease (a term 
that means no organic, metabolic or bio- 
chemical abnormalities), and suggest that 
gut-to-brain pathways might be one driver 
of psychological symptoms in IBS. Could 
psychological distress in IBS be relieved 
or even cured by a gut-based intervention 
that downregulates the cytokine response? 
Meanwhile, the observation that increased 
numbers of mast cells are present in IBS 
dates back to the 1990s, but it has been 
unclear what role mast cells have in the 
generation of pain and other symptoms, 
if any. In elegant experiments utilizing 
colonic biopsy samples from patients with 
IBS, Barbara et al.,’ in 2004, observed a sub- 
stantial increase in the number of degranu- 
lating mast cells releasing histamine and, 
strikingly, mast cells observed in close prox- 
imity to nerves correlated with abdominal 
pain in IBS. The role of blocking mast cells, 
and in particular histamine, in IBS is now 
an area of active interest. 

Intense interest in the role of the gut 
microbiota in gastrointestinal and non- 
gastrointestinal disease continues, and 
the hypothesis that the colonic and small 
intestinal flora is altered in IBS is being 
actively explored. The study by Rajilic- 
Stojanovic et al.* is a technically excellent 
example of the current state of the art, dem- 
onstrating that although stool microbiota 
composition is highly individual, it can be 
used to discriminate IBS from health, with 


reduced numbers of Bifidobacteriaceae, 
Actinobacteria and Bacteroidetes but 
increased levels of Firmicutes in patients 
with IBS compared with healthy individu- 
als. Problems remain, including limited 
sampling techniques, lack of an ability to 
culture most bacteria, and the fact that 
major differences have been observed in 
different studies. Regardless, bacterial 
manipulation might be one of the keys to 
curing FGIDs. If bacteria are important in 
symptom generation in IBS—for example 
by increased gas production that distends 
the lumen and induces bloating—would 
starving them work? In their landmark 
paper from 2008, Shepherd and colleagues? 
tested dietary fructose and fructan restric- 
tion then rechallenge in patients with IBS; 
70% of patients receiving fructose, 77% 
receiving fructans and 79% receiving a 
mixture reported symptom breakthrough 
versus 14% of patients ingesting glucose. 
Other data now support the benefit of 
a low-FODMAP diet in IBS, which is 
being increasingly adopted although large 
randomized blinded trials are needed. 


4é ..a@ purely symptom-based 
classification of the FGIDs 

will be extinct in the next 

two decades... FF 


No drug has been shown to change the 
natural history of IBS (symptoms usually 
rapidly reoccur) until the past decade. If 
the gut microbiota is important in IBS, 
manipulating it should change the natural 
history of the condition at least temporarily. 
In 2011, Pimentel et al.° tested rifaximin, a 
minimally absorbed antibiotic, for 2 weeks 
in patients with diarrhoea-predominant and 
mixed IBS with 10 weeks follow-up. Overall, 
global IBS symptoms and, most notably, 
bloating improved although the therapeu- 
tic gain was modest (~10% of patients). 
The striking observation was that symptom 
improvement persisted during follow-up.° It 
is unknown why only a subgroup responded 
and whether a more targeted attack on the 
most relevant bacterial group would yield 
better results. The response is known not to 
be sustained long term and this factor is also 
unexplained; is this because of microbial or 
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Figure 1 | Functional’ gut disorders. Many 
patients have objective gut pathology and 

a gut-to-brain syndrome that explains their 
gastrointestinal as well as extraintestinal 
symptoms (e.g. anxiety, fatigue and 

sleep disturbance). 


possibly mucosal factors? Probiotics have 
also shown modest symptom benefits in IBS 
(including a Bifidobacteria infantis strain) 
but optimal probiotic therapy remains to 
be defined and benefits to date are small 
although this area has enormous potential.’ 

At last a molecular defect in IBS has 
been uncovered by Coates et al.* and these 
results, although initially controversial, 
have been confirmed. Serotonin (5-HT) is 
released from enterochromaffin cells and 
mediates peristalsis and reflex responses. In 
this important paper, defects in serotonin 
signalling (including serotonin reuptake) 
in IBS and ulcerative colitis were observed 
(although numbers of enterochromaffin 
cells were only reduced in colitis). Gut- 
specific molecular alterations in serotonin 


might explain constipation (insufficient 
5HT), diarrhoea (excess 5HT) and mixed 
IBS (SHT switching between excess and 
insufficient levels). 

Finally, two key papers indicate a para- 
digm shift in adult functional dyspepsia. 
The Kalixanda study’ was a landmark 
endoscopic population-based study from 
northern Sweden that investigated whether 
duodenal eosinophilia characterises ‘pure’ 
functional dyspepsia (no overlap of IBS 
was permitted). Patients with increased 
numbers of duodenal eosinophils had a 
nearly 12-fold increased risk of functional 
dyspepsia, and the pathology was linked 
to early satiety, a finding subsequently 
confirmed by multiple studies. A Belgian 
study'® from 2014 not only confirmed duo- 
denal eosinophilia in functional dyspepsia 
but also showed increased numbers of mast 
cells (arguably attributable to the major- 
ity of the study population having both 
dyspepsia and IBS) and more importantly 
impaired duodenal permeability. Increased 
antigen presentation (from food or bacteria) 
might be key in driving a subtle upper intes- 
tinal inflammatory response in predisposed 
patients, leading to duodenogastric reflex 
responses and early satiety, a condition we 
label as ‘functional dyspepsia. 

Our concepts have changed over the 
past decade and the short-comings of a 
symptom-based only classification of 
FGIDs are becoming increasingly apparent 
(Figure 1). IBS is ‘disappearing’ as we iden- 
tify more and more subgroups (and recog- 
nize organic diseases such as coeliac disease, 
microscopic colitis or bile-salt-induced diar- 
rhoea as clinically indistinguishable from 
IBS). Accumulating data suggest people 
might have either a gut-to-brain or brain- 
to-gut syndrome, both with identical gut 
symptoms. One could make the case it is 
now time to drop the arguably pejorative 
term functional; on the basis of the latest 
information, many patients have gut patho- 
logy. Identifying serum and stool biomarkers 


to objectively diagnose IBS and functional 
dyspepsia now seems plausible. Prediction 
is dangerous but I will stick my neck out: 
a purely symptom-based classification of 
the FGIDs will be extinct in the next two 
decades as knowledge in this area explodes. 
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PRIMEVIEW 


IRRITABLE BOWEL SYNDROME 


Irritable bowel syndrome (IBS) 

is a functional gastrointestinal 
disorder and is not associated with 
structural or biochemical abnormalities 
in the gut. Gastrointestinal symptoms 
include abdominal pain or discomfort, 


stool irregularities and bloating. 

In addition, IBS is often associated with 
somatic comorbidities (for example, 
pain syndromes, overactive bladder 
and migraine), psychiatric conditions 
(including depression and anxiety) 

and visceral sensitivity. 


EPIDEMIOLOGY 


IBS has a global estimated prevalence of ~11%, 
with rates of 5-10% in most European countries, 
the United States and China. Approximately 

2% of all patients show spontaneous disease 
remission every year. Women are more prone to 
developing IBS than men. IBS is strongly associated 
with other conditions and >20% of patients 

with IBS have other functional gastrointestinal 
disorders, including gastroesophageal reflux 


The aetiology 
of IBS is 
multifactorial 
and the precise 
molecular 
pathophysiology 
is not yet 
understood 


Enteroendocrine 


SEROTONIN 


IMMUNE 
ACTIVATION 


nature DETTE: 
REVIEWS bei 


For the Primer, visit doi:10.1038/nrdp.2016.14 


DIAGNOSIS 


IBS can be debilitating in some patients, whereas 
others might have only mild or moderate symptoms. 
IBS is diagnosed using diagnostic criteria that 

take a combination of symptoms into account, 
including abdominal pain or discomfort associated 
with chronic or recurring abnormal bowel habits. 
Other diseases (including functional gastrointestinal 
diseases, such as dyspepsia and gastroesophageal 
reflux disease) that cause typical IBS symptoms 
should be excluded. IBS can be associated with 
diarrhoea, constipation or both, and patients often 
switch between these different subtypes over time. 


MANAGEMENT 


Only half of patients with IBS seek medical 
treatment and pain is an important driver. 

A stepped-care approach to treatment is 
recommended, which includes psychotherapy. 
Medical treatment is aimed at improving bowel 
habits and/or visceral pain and consists of 
antispasmodic drugs, low-dose antidepressants, 


VISCERAL haha 
SENSITIVITY 
disease, diarrhoea, incontinence and pelvic floor (2) 


dyssynergia. In addition, IBS is linked to functional, (2) 


gut motility accelerants and antidiarrhoeals — 
depending on the symptoms. 


non-gastrointestinal syndromes such as pain 
syndromes or disorders associated with dysfunction 
of the autonomic nervous system, including 
overactive bladder, chronic fatigue syndrome and 
eating disorders. Finally, 
several psychiatric 
comorbidities 
(such as anxiety, 
depression and 
neuroticism) are 
associated with 
IBS and other 
IBS-associated 
diseases. 
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The brain—gut interaction is bidirectional. Although structural 


and functional brain abnormalities have been described 
in IBS, whether they are a cause or a consequence of 


the gastrointestinal symptoms is unknown. 


Both the gastrointestinal symptoms and the 
psychiatric comorbidities impair quality of 
life in patients with IBS, which is even lower 
than in those with gastroesophageal reflux 


disease, diabetes mellitus or severe chronic 
kidney disease. 


OUTLOOK (OO) 


Given that IBSisa 
heterogeneous condition, many drugs have failed 
in randomized controlled trials. Adequate patient 
stratification might reveal that some subgroups of 
patients benefit from available drugs, making the 
evaluation of new drugs more efficient. 
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Dietary advice 
plays an 
important part in IBS 
treatment because 
most patients have 
some form of food 
intolerance (that is, a 
physiological response to food 
allergens that is not associated with FS 
an immune response). Diets containing 
low levels of fermentable oligosaccharides, 
disaccharides, monosaccharides and polyols 
(low FODMAP diets) might improve symptoms, 


as can other dietary interventions. 
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Irritable bowel syndrome (IBS) is a heterogeneous functional disorder with a multifactorial etiology that involves the 
interplay of both host and environmental factors. Among environmental factors relevant for IBS etiology, the diet 
stands out given that the majority of IBS patients report their symptoms to be triggered by meals or specific foods. 
The diet provides substrates for microbial fermentation, and, as the composition of the intestinal microbiota is 
disturbed in IBS patients, the link between diet, microbiota composition, and microbial fermentation products might 
have an essential role in IBS etiology. In this review, we summarize current evidence regarding the impact of diet 

and the intestinal microbiota on IBS symptoms, as well as the reported interactions between diet and the microbiota 
composition. On the basis of the existing data, we suggest pathways (mechanisms) by which diet components, via the 
microbial fermentation, could trigger IBS symptoms. Finally, this review provides recommendations for future studies 
that would enable elucidation of the role of diet and microbiota and how these factors may be (inter)related in the 


pathophysiology of IBS. 
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INTRODUCTION 

With a prevalence of approximately 10-15%, irritable bowel syn- 
drome (IBS) is one of the most common gastrointestinal (GI) 
disorders in the industrialized world (1). The high prevalence 
together with the reduced quality of life and associated co-mor- 
bidities in patients suffering from IBS put a significant negative 
burden on both patients and society (2). IBS is a heterogeneous 
functional disorder and commonly subtyped according to the 
prevailing bowel habit into IBS with constipation (IBS-C), IBS 
with diarrhea (IBS-D), mixed IBS with both constipation and 
diarrhea and unsubtyped IBS with neither constipation nor diar- 
rhea using the Rome ITI criteria (3). 

Although the etiology of IBS is incompletely understood, it 
is generally regarded as a multifactorial disorder involving the 
interplay of both host and environmental factors, including diet. 
The host factors include central factors, such as aberrant stress 


responses, psychiatric co-morbidity, and cognitive dysfunctions, 
whereas intestinal functions are also involved, e.g., dysmotility, 
visceral hypersensitivity, low-grade immune activation, altered 
barrier function, and the intestinal microbiota composition (4). 

During recent years, perturbations in the intestinal microbiota 
are increasingly being linked to the pathophysiology of IBS. The 
increased risk of new onset IBS after an episode of gastroenteritis (5) 
and the association with prior antibiotic use (6) support the impor- 
tance of the intestinal microbiota in IBS. Numerous studies have 
demonstrated altered microbial profiles in (specific subgroups of) 
IBS patients compared with healthy individuals (7-11). The intesti- 
nal microbiota strongly interacts with exogenous factors, in particu- 
lar diet, which may also directly or indirectly provoke IBS symptoms. 

In this review, we describe the current evidence regarding the 
impact of diet and the intestinal microbiota and their (inter)rela- 
tion on the pathophysiology of IBS. 
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THE IMPACT OF DIET ON THE INTESTINAL 
MICROBIOTA 

The fetal intestine is thought to be sterile, although some recent 
studies indicate that colonization may already start before deliv- 
ery, by bacterial transmission through the placental barrier 
(12,13). The colonization during and upon delivery involves a 
succession of microbial populations, a seemingly chaotic process 
strongly influenced by birth mode and infant feeding. Vaginally 
born infants are initially colonized by maternal fecal and vaginal 
microbes, such as Bifidobacterium, Bacteroides, Prevotella, and 
Lactobacillus spp., whereas infants born by cesarean section are 
inoculated by typical skin bacteria and bacteria from the hospital's 
environment (14,15). 

The strong impact of diet on the indigenous microbial commu- 
nities can already be observed even before weaning. Compared 
with formula-fed infants, the microbiota of breastfed infants is 
less diverse and more dominated by Bifidobacterium spp. (16-18). 
This is generally attributed to unique bioactive compounds such as 
human milk oligosaccharides that serve as metabolic substrates for 
a limited number of microbes (19), although breast milk itself also 
harbors a transferrable microbiota (20). A rapid diversification of 
the infants’ intestinal microbiota and shift toward an adult-like 
microbiota occurs with the introduction of solid foods (21,22); yet, 
the maturation of the microbiota appears to continue in parallel 
to physiological development of, for example, the gastrointestinal 
tract and the central nervous system, throughout childhood and 
adolescence (23,24). 

Once established, the intestinal microbiota appears to be rela- 
tively stable within individuals over time (25), whereas high vari- 
ability between individuals is observed (26-29). 

Pronounced differences in the composition and functional 
capacity of the human microbiota between geographically distant 
populations point toward an important role of the diet in these 
inter-individual variations (26,30-32). The microbiota in non- 
Western populations, consuming a diet high in plant-derived 
carbohydrates, has consistently been shown to be more diverse 
and enriched in Prevotella spp. at the expense of Bacteroides spp. 
as compared with Western populations consuming a diet high in 
animal protein, sugar, starch, and fat (reviewed in (33)). It remains 
to be determined to what extent the microbiota composition is 
shaped by the diet, as the impact of genetics, ethnic, and cultural 
factors, such as hygiene and living conditions, cannot be ruled out 
in such studies comparing distinct populations. 

Nevertheless, habitual long-term dietary patterns have also 
been directly linked to intestinal microbial enterotypes. Especially, 
protein and animal fat intake has been associated with the Bacte- 
roides enterotype, whereas a high carbohydrate intake was associ- 
ated with the Prevotella enterotype (34). The effect of short-term 
dietary interventions on the microbiota composition appears to be 
only modest (34,35), unless the intervention comprises an extreme 
switch in diet (36-38). David et al. (37) demonstrated that even 
within days after the transition to a diet entirely composed of ani- 
mal or plant products, changes in microbial structure, metabolic 
activity, and gene expression can be observed. In subjects receiv- 
ing the animal-based high fat and protein diet, the abundance of 
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bile-tolerant bacteria (Alistipes, Bilophila and Bacteroides) together 
with branched short-chain fatty acids indicative of amino-acid 
fermentation increased, whereas levels of bacteria metabolizing 
dietary plant polysaccharides (Eubacterium rectale, Roseburia, 
Ruminococcus bromii) decreased. In agreement, switching to a diet 
high in resistant starch has been shown to increase Ruminococ- 
cus bromii and Eubacterium rectale related species (39), which are 
known for their saccharolytic properties. Altogether, these studies 
indicate that the intestinal microbial community structure as well 
as its function and metabolic output are influenced by our diet, 
especially in the case of clearly distinct dietary patterns. 


INFLUENCE OF DIET ON IBS 

Surveys on perceived food intolerance show that 64-89% of IBS 
patients report their symptoms to be triggered by meals or spe- 
cific foods (40-42) The majority indicate that they limit or exclude 
certain foods without professional counseling (40,42), thereby 
increasing the risk of inadequate dietary intake. Monsbakken 
et al. (40) found indications for inadequate dietary intake in 12% 
of IBS patients studied, whereas others found adequate or even 
increased intake of some nutrients (43-45). Overall, data on 
habitual dietary intake are limited and may depend on the study 
population included. Meal-related symptom aggravation is fur- 
ther supported by studies showing a postprandial worsening of 
pain using symptom diaries for 6 weeks (46) or rectal barostat 
(47,48). 

Foods often reported to provoke symptoms include wheat/ 
grains, vegetables, milk products, fatty foods, spicy foods, coffee, 
and alcohol (40-42,49) and are especially reported to be associ- 
ated with abdominal pain and gas problems (41,42). Overall, the 
percentage of responders on exclusion diets varies from 15 to 71% 
(50), but it has to be noted that most studies suffer from major 
methodological limitations and only a minority of subjects was 
found to react positively to double-blind food challenges (51-53). 

Although placebo and possibly nocebo effects have to be consid- 
ered in IBS, possible physiological mechanisms of (perceived) food 
intolerance include an exaggerated sensory and motor response 
and/or incomplete absorption, which may lead to symptoms in a 
susceptible host. High fat intake, for example, is associated with an 
exaggerated colonic motor response to eating and increased visceral 
sensitivity in IBS patients, whereas duodenal lipid infusions delay 
small intestinal transit. Randomized controlled trials adjusting fat 
intake are, however, limited (54). Poorly absorbed carbohydrates, 
like lactose, fructose, and galacto- or fructo-oligosaccharides 
(fructans), can result in luminal distension by osmotic effects and 
increased gas production due to microbial fermentation. Although 
this is a normal physiological phenomenon, it can result in symp- 
toms in subjects with an altered microbiota, increased visceral 
sensitivity, and/or abnormal gas handling. A recent study by Yang 
et al. (55) found increased mucosal mast cell numbers, serum 
tumor necrosis factor-a, rectal sensitivity, and anxiety in lactose 
intolerant IBS-D patients, supporting that neuro-immune modula- 
tion of visceral function is a potential underlying mechanism. Per- 
ceived intolerance to lactose, fructose, and fructans is frequently 
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reported and uncontrolled studies point to symptom improvement 
after removal of milk containing products or wheat from the diet 
(42). However, these studies should be interpreted with caution 
because of a large placebo and nocebo response in IBS. The overall 
evidence for an increased incidence of lactose intolerance in IBS 
patients was found to be weak and to be moderate—weak for the 
benefit of low lactose intake in these subjects (56). 

Evolving from previous studies on lactose and fructose intol- 
erance, a more generalized approach of intolerance to poorly 
absorbed and rapidly fermented carbohydrates has been intro- 
duced: FODMAPS, including fermentable oligo-, di- and mono- 
saccharides and polyols. Both retrospective and prospective open 
studies have shown reduced symptoms after introduction of a low- 
FODMAP diet in IBS patients with suspected or proven fructose 
or lactose malabsorption (57,58). These findings are confirmed by 
randomized controlled trials, showing a significant improvement 
of overall symptoms, abdominal pain and bloating, in patients 
with bloating and/or diarrhea (59) and in small (m = 15-30) unse- 
lected IBS patient groups (60,61). One study showed recurrence 
of symptoms on re-challenge with fructose and/or fructans (62). 
Involvement of gas production is supported by findings that IBS 
patients produced more hydrogen relative to healthy controls and 
lacked an increased methane production on a high vs. low-FOD- 
MAP diet (60). However, it should be noted that the long-term 
benefit of a low-FODMAP approach in large (unselected) patient 
groups is unclear, especially considering its invasiveness, which 
requires strict guidance by a dietician, and the potential risks of, 
for example, reduced fiber intake and changes in microbiota com- 
position and activity (59,63). Furthermore, randomized controlled 
trials are needed to prove whether it is superior to other dietary 
interventions (e.g., lactose or fructose reduction) or general die- 
tary advice (e.g., the NICE guidelines). Finally, the FODMAP lev- 
els vary between studies, countries, and products, which should be 
taken into account when interpreting study results and consider- 
ing low-FODMAP diets. 

Foods can also evoke symptom onset by immune activation or 
altered neuro-endocrine responses (42). Food allergy or intoler- 
ance associated with IgE-mediated immune responses is uncom- 
mon and evidence for IgG/IgG4-mediated hypersensitivity is 
inconclusive (42,64). However, in subsets of IBS patients, increased 
numbers of T-lymphocytes, mast cells, eosinophils, and/or enter- 
oendocrine cells have been found (65-67), but their exact role in 
food intolerance and symptom generation is unclear. 

Interest in the role of gluten intolerance (and the possible ben- 
efit of gluten-free diets) is increasing. A systematic meta-analysis 
found a pooled prevalence for celiac disease up to 4% among 
IBS patients (68). The prevalence of non-celiac gluten sensitivity 
among unselected IBS patients is unknown and complicated by 
clear diagnostic criteria and overlap in symptoms. Although symp- 
tom reduction after a gluten-free diet (69) and worsening of symp- 
toms after a gluten challenge (70) have been reported, the recent 
study Biesiekierski et al. (71) found no symptom induction by giv- 
ing pure gluten to IBS patients following a gluten- and FODMAP- 
free diet. These findings suggest that not gluten but fructans and/ 
or other components might contribute to symptoms in perceived 
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“wheat” intolerance. Furthermore, Carroccio et al. (72) showed 
that the majority of IBS patients diagnosed with wheat sensitivity 
had multiple food sensitivities. 

Dietary supplementation studies focus on probiotics, prebiotics, 
synbiotics, and fiber intake. A recent meta-analysis on the use of 
probiotics shows significant effects on subjective global symptom 
improvement, but study heterogeneity is statistically significant 
and no conclusions can be drawn on which individual species or 
strains are most beneficial (73). Studies on prebiotics and synbiot- 
ics are limited and insufficient to draw conclusions. Evidence for 
the benefit of dietary fiber was found for soluble fiber intake only 
(74), but it should be noted that overall study quality was moderate 
and more studies are needed on its effects in IBS subgroups. 

Although several food components are associated with symp- 
tom generation, detailed analyses on the effect of food intake on 
symptom generation and underlying mechanisms are limited. In 
addition, many studies suffer from methodological limitations, in 
part inherent to the complexity of food research. 


MICROBIOTA ALTERATIONS IN IBS 

Following the first comprehensive analysis reporting a distinctive 
intestinal microbiota composition in IBS patients (75), several 
studies have identified differences between the microbiota of IBS 
patients, or subgroups thereof, and healthy controls (reviewed 
in (7)). Recently, clinical guidance regarding the modulation of 
intestinal microbiota in IBS was provided by the Rome Team 
Working Group (76), which concluded that there is good evidence 
supporting the concept that the intestinal microbiota is perturbed 
in patients with IBS. However, despite a growing consensus 
regarding an association between the intestinal microbiota and 
IBS, results of current studies lack general consensus and a spe- 
cific microbial signature in IBS remains elusive (Figure 1). Lack 
of detailed phenotypic characterization of patients, small sam- 
ple sizes, and the cross-sectional study designs (providing only a 
single snapshot of the microbiota composition) in the majority 
of studies may all contribute to the fact that the markers of the 
IBS microbiota reported in several studies (marked in bold on 
Figure 1) are not reproducibly detected in all cohorts. Moreover, 
exogenous factors, including diet, are most often not taken into 
account, despite the demonstrated effect of diet on the microbiota 
and the potential alterations in diet associated with IBS. Finally, 
it is noteworthy that Jeffrey and colleagues identified distinct 
subsets of IBS patients, not corresponding to the traditional IBS 
subtypes, with an altered or normal-like microbiota composition 
(8), indicating that the disturbed microbiota might be relevant for 
the pathology of part of the IBS patients. Moreover, those with a 
normal-like microbiota had more adverse psychological factors, 
suggesting that central factors may predominate over microbio- 
logical factors in some but not all IBS patients. 

Several recent comprehensive studies of the microbiota in IBS 
have reported an increase in the relative abundance of Firmicutes, 
mainly Clostridium cluster XIVa and Ruminococcaceae, together 
with a reduction in the relative abundance of Bacteroidetes 
(7-9,77). Bifidobacteria have been shown to be depleted in both 
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Figure 1. Phylogenetic tree of the human intestinal microbiota with indicated microbial groups that are significantly altered in irritable bowel syndrome 
(IBS) patients relative to controls. The increase and decrease in microbial groups in IBS patients are marked by arrows facing up and down, respectively, 
and in bold if reproduced by at least two independent studies. If microbial groups are altered only in diarrhea or constipation predominant IBS patients, 
letter D or C in superscript follows the arrows, respectively. The figure is generated on the basis of the data published in (8-11,75,77-79,82,85,98,116). 


fecal and mucosal samples of IBS patients (9,78-81), which adds 
to the trend toward a modest beneficial effect of Bifidobacterium 
supplementation, in terms of the improvement of global IBS 
symptoms and pain scores as reported in a meta-analysis of pro- 
biotic trials.(73) A lower diversity and a higher instability of the 
(Firmicutes fraction or function of the) microbiota in IBS patients 
relative to controls have been reported (82-85), although these 
findings need confirmation in studies with sufficient depth of the 
microbiota analysis. 

Most if not all studies that compare microbial profiles in IBS 
patients and healthy controls are associative and cannot distinguish 
cause from consequence. However, evidence of a potential causal 
role for the GI microbiota is supported by a fecal transplantation 
experiment in which visceral hypersensitivity, presumed to under- 
pin abdominal pain in a subgroup of IBS patients, could be trans- 
ferred via the microbiota of IBS patients to previously germ-free 
rats (86). Furthermore, germ-free mice colonized with the micro- 
biota from IBS-D patients have been shown to exhibit a faster GI 
transit and impaired intestinal permeability as compared with mice 
gavaged with the microbiota from healthy humans (87). The mech- 
anisms through which the microbiota exerts these effects are not 
fully understood, but the fact that supernatants from colon biopsies 
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of IBS patients, with increased levels of histamine and proteases, 
excite human submucosal neurons could indicate a brain-gut axis 
connection (88). Furthermore, an increase in fecal serine proteases 
has been linked with symptom development in IBS patients (89,90). 
Although the origin of both histamine and proteases could be both 
human and microbial, it is noteworthy that some of the Firmi- 
cutes bacteria that were found to be increased in abundance in IBS 
patients (8,9) are known to secrete large amounts of extracellular 
proteases (91). Alternatively, the predominantly endogenous fecal 
proteases might not be degraded by microbes because of the accel- 
erated transit (as seen in IBS-D patients) or because of the disturbed 
metabolic activity of the microbiota (92). Serine protease inhibitors, 
the antagonists of serine proteases, are produced by many bacteria, 
including bifidobacteria (93), which may contribute to the possible 
benefit of bifidobacterial supplementation. 

Dysregulated intestinal immune function, chronic low-grade 
mucosal inflammation, and increased mucosal permeability and 
barrier dysfunction have all been suggested as putative patho- 
genic mechanisms in IBS, in which the intestinal microbiota 
might have a role (reviewed in (76,94)). Moreover, the bidirec- 
tional interactions between the intestines and the central nervous 
system, which have an important role in the pathogenesis of IBS, 
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have suggested to be modulated by the microbiota (reviewed in 
(95)). Yet, there is a lack of data on the exact mechanisms through 
which the host-microbiota interactions underlie pathophysiology 
and generate symptoms. Identifying these mechanisms is further 
complicated by the fact that the majority of GI microbes remain 
uncharacterized (96). Indeed, most putative microbial markers 
of IBS are among uncultured bacteria. For example, uncultured 
bacteria related to Ruminococcus torques are significantly enriched 
in IBS patients in several cohorts (9,11,75), and their abundance 
positively correlates with bowel symptoms (9,10,97). It is intrigu- 
ing that this bacterial group has been shown to be suppressed 
following a multispecies probiotic treatment that alleviated 
IBS symptoms (98). Similarly, uncultured bacteria within the 
Clostridiales order are reproducibly detected in significantly 
depleted abundance in IBS (9,10) but also in ulcerative colitis (99). 
Focus on the function of these uncultured bacteria should be of 
major interest for future studies. 


MICROBIOTA-DIET INTERACTIONS IN IBS 

Intestinal microbes have an important role in the digestion of 
dietary components, resulting in metabolites that may directly or 
indirectly contribute to IBS symptoms (Figure 2). 

Colonic fermentation of carbohydrates mainly results in short- 
chain fatty acids (SCFAs, including acetate, propionate, and 
butyrate) and variable amounts of hydrogen and carbon dioxide. 
Several low-abundant microbial groups can subsequently dis- 
pose the produced hydrogen into acetate, hydrogen sulfide, and 
methane. The fermentation of protein residues, primarily in the 
distal colon, produces a variety of metabolites, including ammo- 
nia, organic acids, heterocyclic amides, and phenolic and indolic 
compounds, that are toxic and detrimental to gut health (100). 
Moreover, fat and digested proteins increase the excretion of bile 
acids that are subject to extensive bacterial transformation in the 
intestinal tract. This could be of particular relevance as bile acids 
inhibit many bacteria and may modulate microbiota composition 
and also have the potential to affect major pathophysiological fac- 
tors in IBS, including GI motility, secretion, and immune function 
(101,102). Indeed, in a recent large-scale study, increased colonic 
bile acid exposure was demonstrated in a subset of, predominantly 
non-constipated, IBS patients correlating with diarrhea and accel- 
erated colonic transit (101). 

An association between the intestinal microbiota and pro- 
tein metabolism has been established by a significant correlation 
between the abundance of several post-infectious (PI)-IBS micro- 
bial markers and host amino-acid metabolism (10). Among harm- 
ful products of protein fermentation (100,103), hydrogen sulfide 
might be relevant for compromising intestinal health as it directly 
impairs epithelial metabolism (104) and acts as a gut-brain sign- 
aling molecule (105). Hydrogen sulfide can be converted to thio- 
sulfate and further oxidized to tetrathionate during inflammation. 
The latter supports the growth of Salmonella (106) and other 
tetrathionate utilizing pathogens from the Gamma-Proteobacteria 
class (107), many of which have been associated with bowel symp- 
toms of IBS patients (9,10,82). 
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FODMAPS, indigestible carbohydrates, can be fermented by 
intestinal microbes, resulting in increased gas production and an 
osmotic effect that can provoke IBS symptoms in a sensitive host 
(108), although similar changes may go unperceived in normal 
subjects (109). Carbohydrate utilizing GI bacteria such as Dorea 
spp., major gas producing bacteria in the human intestine (110), 
were reported to be significantly increased in abundance in IBS 
patients (9,11). Overproduction of gas, specifically hydrogen, has 
been associated with IBS symptoms (111), especially abdomi- 
nal pain and flatulence. In patients with IBS-D, whose colon was 
found to be less able to accommodate the increased intestinal vol- 
ume (112), overproduction of gas could be a trigger for increased 
colonic wall tension. In general, hydrogen produced in the gut is 
most efficiently removed by the methanogenic archaea (113) that 
appear to be depleted in IBS-D patients (9,10) and increased in 
IBS with constipation patients (reviewed in (114)) Although there 
is substantial evidence for implication of methane in IBS sympto- 
mology, most data are based upon methane breath testing. Hence, 
future studies, especially those addressing the diet-microbiota 
connection in IBS, should quantify methanogens to provide more 
direct evidence. 

Alternative pathway for gas elimination is the conversion into 
acetate by acetogenic bacteria, Blautia spp. (115). Blautia spp. are 
highly prevalent and dominant intestinal bacteria, which have 
been found to be elevated in IBS patients (9,79). Intriguingly, the 
major fermentation end-product—acetate—has been reported as 
significantly increased in IBS patients (116), although this was 
not confirmed in another study (117). This may relate to the fact 
that most studies measure stool concentrations without taking the 
total stool volume into account, thereby potentially concealing 
overall increased acetate production as a result of dilution due to 
increased stool volume. 

SCFAs produced upon the microbial fermentation of undigested 
carbohydrates and reduced levels thereof may also contribute to 
symptom generation in IBS. SCFAs have some documented health 
benefits, in particular butyrate, which is an important energy source 
for colonocytes, and among others inhibits inflammation and 
enhances barrier function (118). Moreover, intraluminal adminis- 
tration of butyrate into the distal colon has been shown to decrease 
visceral sensitivity in healthy humans (119). However, excessive 
intake of fermentable carbohydrates and high SCFA levels can also 
increase the osmotic load and thereby lead to diarrhea (120). 

Data on SCFAs in IBS are limited and inconsistent, showing no 
alterations, increased or decreased levels of SCFAs relative to con- 
trols (61,116,117,121). Furthermore, altered SCFAs profiles might 
merely reflect diet rather than a feature of the condition. Given 
that the major products of FODMAPs fermentation are SCFAs, 
and that some major microbial SCFA-producers may be altered in 
IBS, the connection between FODMAPs, SCFAs, microbiota, and 
IBS symptoms calls for further elucidation. 

Few studies have examined the impact of dietary interventions, 
in particular FODMAP restriction, on the microbiota in IBS 
patients. A low-FODMAP diet has been linked to reduced bifido- 
bacterial counts (59), which seems a paradox given their potential 
health benefit. A recent study reported a reduction in total bac- 
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Figure 2. Non-exclusive listing of dietary metabolites that can be regulated by microbial activity and contribute to irritable bowel syndrome (IBS) symp- 
toms. The impact of metabolites affecting different domains of intestinal health is depicted; however, it should be noted that the effects of some metabo- 
lites depend on their concentration or further conversion. Note: for immune modulation and barrier function, metabolites with positive and negative effects 
are colored in green and red, respectively. SCFAs, short-chain fatty acids; BAs, bile acids. 


terial abundance following the introduction of a low-FODMAP 
diet, as well as a reduction in absolute counts but not relative 
abundance of specific bacterial groups, including bifidobacte- 
ria as compared with habitual diet (63). However, as compared 
with a diet high in FODMAP content, the low-FODMAP diet 
appeared to be associated with marked lower relative abundances 
of butyrate-producing bacteria and A. muciniphila, and a signifi- 
cantly higher abundance of R. torques, adding more paradoxical 
data on the effects of low-FODMAP diet (63). 
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Extensive analyses of microbiota composition, functionality, 
and fermentation products in relation to FODMAP restriction and 
symptom generation are currently, however, lacking. 

The direct impact of diet-microbiota interactions on IBS symp- 
toms has been demonstrated in an elegant study on the impact 
of a diet rich in fermentable substrates. The introduction of this 
flatulogenic diet significantly increased the gas volume and 
the number of gas evacuations, as well as abdominal symptoms 
and digestive discomfort in both healthy subjects and flatulent 
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patients (122). Yet, upon dietary challenge, patients’ microbiota 
developed instability in composition, exhibiting variations in the 
Bacteroidetes and Firmicutes phyla and reduction in microbial 
diversity, whereas healthy subjects’ microbiota remained stable. 
Hence, the extent of dysbiosis in IBS patients likely depends on 
their diet. In a small study among children with IBS, the intro- 
duction of a low fermentable substrate diet significantly decreased 
abdominal pain severity and frequency in some but not all 
children. Responders to this dietary intervention appeared to differ 
in intestinal microbiota composition at baseline from nonrespond- 
ers, indicating that the efficacy of dietary interventions might be 
influenced by the patient’s microbiota (123). To prove the validity, 
these pilot results will have to be replicated in larger studies. 


CONCLUSION, RECOMMENDATIONS FOR FUTURE 
RESEARCH 

Both experimental and observational studies provide good evi- 
dence to conclude that microbial alterations frequently observed 
in IBS patients potentially drive or perpetuate gastrointestinal 
symptoms. Yet, the strong interaction between distinct dietary 
patterns and the intestinal microbial communities is likely one 
of the explanations for the inconclusive findings among stud- 
ies comparing the microbiota composition in IBS patients and 
healthy subjects. 

More well-designed studies on the effect of food intake on symp- 
tom generation and the underlying mechanisms are warranted. 
The potential benefit of FODMAP restriction seems promising 
but requires further study and dietary guidance is essential given 
the complexity and risk of inadequate nutrient intake. In addi- 
tion, more evidence on long-term outcome and well-performed 
challenge or re-introduction trials are warranted as important 
adaptions may occur. The incorporation of microbiota characteri- 
zation in such studies would both provide insight into the impact 
of these dietary interventions on the microbiota, as well as offer 
the opportunity to examine whether the microbiota can predict 
response. 

Given that IBS patients comprise a heterogeneous group both 
in the type of bowel symptoms and food components to which 
they might respond, and that the microbiota is extremely com- 
plex and variable between individuals, it is clear that defining the 
correlations between IBS, microbiota, and diet is not an easy task. 
However, high-throughput molecular methods enable reproduc- 
ible and in-depth analysis of the intestinal microbiota, providing 
the necessary tool for testing these correlations. The major chal- 
lenge remains to include the conscientious collection of dietary 
information in future studies in this field. Moreover, future stud- 
ies should include multiple microbiota analysis at various time 
points in a large number of well-phenotyped patients to increase 
our insight in the contribution of intestinal microbiota pertur- 
bations to the diverse and fluctuating symptom pattern in IBS 
patients. 
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Dysbiosis leading to abnormal intestinal fermentation has been suggested as a possible etiological 


mechanism in irritable bowel syndrome (IBS). We aimed to investigate the location and magnitude of 


intraluminal pH and fecal short-chain fatty acids (SCFAs). Intraluminal pH and intestinal transit 
times were measured in the small and large bowel using a wireless motility capsule (SmartPill) in 
47 |IBS and 10 HC. Fecal SCFAs including acetate, propionate, butyrate, and lactate were analyzed 
by capillary gas chromatography in all enrolled subjects. Correlations between intestinal pH, fecal 


OBJECTIVES: 
altered intestinal bacterial fermentation in IBS and its clinical subtypes. 

METHODS: IBS patients who satisfied the Rome III criteria (114) and 33 healthy controls (HC) were 
investigated. Intestinal fermentation was assessed using two surrogate measures: intestinal 
SCFAs, intestinal transit time, and IBS symptom scores were analyzed. 

RESULTS: 


Colonic intraluminal pH levels were significantly lower in IBS patients compared with HC (total 


colonic pH, 6.8 for IBS vs. 7.3 for HC, P=0.042). There were no differences in total and segmental 
DH levels in the small bowel between IBS patients and HC (6.8 vs. 6.8, P=not significant). The 
intraluminal colonic pH differences were consistent in all IBS subtypes. Total SCFA level was 
significantly lower in C-IBS patients than in D-IBS and M-IBS patients and HC. The total SCFA level 
in all IBS subjects was similar with that of HC. Colonic pH levels correlated positively with colon 
transit time (CTT) and IBS symptoms severity. Total fecal SCFAs levels correlated negatively with CTT 


and positively with stool frequency. 


CONCLUSIONS: Colonic intraluminal pH is decreased, suggesting higher colonic fermentation, in IBS patients 
compared with HC. Fecal SCFAs are not a sensitive marker to estimate intraluminal bacterial 


fermentation. 
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INTRODUCTION 

Irritable bowel syndrome (IBS) is characterized by chronic or 
recurring abdominal pain or discomfort associated with altered 
bowel habits and is one of the most common gastrointestinal (GI) 
disorders (1,2). IBS is considered a multifactorial disorder associ- 
ated with visceral hypersensitivity, altered gut motility, and dys- 
function of the brain-gut axis and immune system. However, the 
pathophysiology of the disorder is still not completely understood 
(2-5). Recently, it has been suggested that alterations in the gut 


microbiota, leading to abnormal intestinal fermentation, may be a 
possible etiological mechanism (6-9). 

Intestinal fermentation by gut microbiota is a central physi- 
ological process by which polymers, including carbohydrates, 
are biotransformed into end-products, mainly short-chain fatty 
acids (SCFAs) and gases. These intraluminal fermentation pro- 
cesses have an important role in supplying nutrients and energy 
to the host (10). However, alterations in intestinal fermentation 
may lead to certain physiological abnormalities that are often 
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observed in IBS such as intraluminal excessive gas production 
and altered motility. These alterations in intestinal physiology 
can be an important factor in provoking or exacerbating IBS 
symptoms (11,12). In addition, interventional clinical studies 
targeting the intestinal microbiota with antibiotics or probiotics 
demonstrated beneficial effects in some patients with IBS (13,14). 
Specifically, in regard to intestinal fermentation, recent studies 
have shown that a diet low in FODMAPs (fermentable oligo- 
saccharides, disaccharides, monosaccharides, and polyols) can 
offer considerable symptom relief in patients with IBS (15,16). 
However, the role of altered intestinal fermentation by intestinal 
bacteria in the pathogenesis of IBS has not been adequately inves- 
tigated in this condition. 

We hypothesized that bacterial fermentation is altered in cer- 
tain segments of the GI tract in IBS and that these alterations are 
associated with bowel functions and symptoms. The aims of the 
current study were to investigate the location and magnitude of 
altered intestinal bacterial fermentation in IBS and its clinical sub- 
types and to examine their relation to bowel characteristics and 
GI symptoms. Intestinal fermentation was assessed by measuring 
intraluminal pH levels (by a wireless motility capsule (SmartPill; 
Given Imaging, Yoqneam, Israel)) and fecal SCFAs concentration 
as surrogate markers in a cohort of well-characterized patients 
with IBS and healthy controls (HC). 


METHODS 

Study population 

All subjects were recruited from the University of North Caro- 
lina (UNC) healthcare outpatient clinics and from the Chapel Hill 
general population by advertising. Inclusion criteria included age 
of 18 years or older, any gender, race, or ethnicity. HCs had no 
chronic or recurring GI symptoms. IBS patients met the Rome 
III criteria for IBS and had mild-to-moderate symptom severity 
with the IBS-symptom severity scale (IBS-SSS) score of 175-300. 
Additional inclusion criteria included normal physical examina- 
tion and laboratory tests and a normal colonoscopy within the 
last 3 years, in patients aged 50 years or older. Subjects with a his- 
tory of GI tract surgery (other than appendectomy or cholecys- 
tectomy), a history of inflammatory bowel diseases, celiac disease, 
lactose malabsorption, or any other diagnosis that could explain 
their chronic or recurrent bowel symptoms were excluded from 
the study. In addition, participants were excluded if they had a 
history of treatment with antibiotics, anti-inflammatory agents— 
including aspirin, non-steroidal anti-inflammatory drugs or ster- 
oids, or intentional consumption of probiotics in the past 6 weeks. 
All subjects were evaluated by a physician to exclude an alterna- 
tive diagnosis to IBS. The study was approved by the UNC inter- 
nal review board, and all subjects provided written consent before 
participation and were compensated for participating in the study. 


Clinical symptom assessments in IBS 


Abdominal pain/discomfort, altered bowel habit (loose/watery 
or hard/lumpy stools), and bloating were assessed using a 10-cm 
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visual analogue scale (maximum score, 10) (17). IBS-SSS (18) and 
IBS-specific quality-of-life (IBS-QOL) (19) were also assessed. 
Stool frequency was measured by the number of bowel move- 
ments per day and stool consistency determined using the Bristol 
Stool Scale (20). 


Assessment for intraluminal fermentation and intestinal transit 
Intestinal fermentation was assessed using two surrogate mark- 
ers: intestinal intraluminal pH and fecal SCFAs. Intraluminal pH 
levels were measured using a wireless motility capsule (Smart- 
Pill). The wireless motility capsule is an FDA approved device 
that is clinically used to assess intestinal transit. After being 
swallowed, the capsule constantly measures intraluminal pH, 
pressure, and temperature as it travels through the GI tract and 
transmits this information wirelessly to a portable receiver worn 
by the examinee (21). Using these parameters and a specific soft- 
ware, the (SmartPill) system generates data on gastric emptying 
time, small bowel transit time (SBTT), colon transit time (CTT), 
and whole-gut transit time (WGTT) (21,22). For the purpose of 
this study, we used the intraluminal pH data as surrogate mark- 
ers for intestinal fermentation. To enable further accuracy in 
assessment of localization of intestinal fermentation, the small 
and large bowel pH measurements were both divided into four 
quartiles (Q1-4) based on the total time in each (small and large 
bowel) segment. SBTT, CTT, and WGTT were also assessed as 
previously described (21,22). 


Analysis of SCFA 

Fresh stool samples were collected from all subjects on site or 
at home in the morning of the study visit. Each fecal sample 
was immediately transferred to the laboratory where it was 
homogenized and stored at —80°C until analyzed. Fecal acetate, 
propionate, butyrate, lactate, and total SCFAs were quantified in 
fecal samples in duplicate using capillary gas chromatography 
with 2-ethyl butyric acid as an internal standard, as described 
previously (10,23-25). 


Statistical analysis 

We used Student's t-test when there were two groups being 
compared and analysis of variance when there were more 
than two groups being compared for continuous variables. The 
x test was used for discrete variables. For post hoc analysis, 
Tukey’s honest significance test was used. Baseline charac- 
teristics were evaluated by Student's t-test or Pearson’s 7’ 
test. Intestinal pH, fecal SCFA levels, and GI transit times 
were compared between groups by Student’s t-test or analysis 
of variance. Correlations between colonic pH, fecal SCFAs, 
CTT, stool frequency and consistency, IBS-SSS, and IBS-QOL 
were performed using Spearman’s correlation analysis for all 
study subjects including IBS and HC. All P-values were two- 
tailed, and P-values<0.05 were considered statistically signifi- 
cant. Statistical analyses were performed using SPSS version 
18.0 for Windows (SPSS, Chicago, IL, USA). Unless otherwise 
stated, data are presented as mean+s.d. 
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Table 1. Baseline characteristics of the study groups 


IBS HC P value 

Number of subjects 114 Se) 
Age (years), mean 35.4411.3 Se See lsa0) 0.534 
Sex (male:female) 16:98 4:29 O778 
Body mass index (kg/m?), 27.1+5.5 BU aes) 2 0.929 
mean 
Race 0.054 

Caucasian 84 (73.7%) 20 (60.6%) 

Black 24 (21.1%) TAN 276) 

Others 6 (5.3%) 6 (18.2%) 
IBS subtype (number) 

Constipation-predominant 26 (22.8%) 


Diarrhea-predominant 42 (36.8%) 
Mixed S2 (Zoe) 


Unspecified 14 (12.3%) 


HC, healthy control; IBS, irritable bowel syndrome. 
The data are presented as mean+s.d. 


RESULTS 

Study population 

We investigated a total of 147 subjects (114 IBS and 33 HC). The 
study IBS population included patients with constipation-pre- 
dominant IBS (C-IBS, n=26), diarrhea-predominant IBS (D-IBS, 
n=42), mixed type IBS (M-IBS, n=32), and unspecified type IBS 
(n=14). The racial distribution was as follows: Caucasian 70.7%, 
Black 21.1%, and other 8.2%. There were no significant differences 


in age, sex, body mass index, and race between the study groups 
(Table 1). 


Intestinal intraluminal pH 

Intestinal intraluminal pH data were collected from 47 patients 
with IBS and 10 HCs. Mean total colonic pH levels were signifi- 
cantly lower in the IBS group compared with HC (6.8 vs. 7.3, 
P=0.042). The significant differences were also observed in Q1 
(6.4 vs. 6.8, P=0.011) and Q4 (7.2 vs. 7.8, P=0.046). The colonic pH 
levels at the Q1 were the lowest among the four quartiles in both 
groups (Table 2 and Figure 1). The lower intraluminal colonic 
pH levels compared with HC were consistent across all IBS sub- 
types (mean total colonic pH: 7.0 for C-IBS, 7.0 for D-IBS, 6.8 for 
M-IBS, 6.4 for unspecified type IBS, and 7.3 for HC, respectively; 
Table 3). 

There were no intraluminal pH differences in the four quartiles 
and total small bowel between IBS and HC (mean total small bowel 
pH: 6.8 for IBS vs. 6.8 for HC, P=0.937; Table 2). Also, there were 
no significant intraluminal small bowel pH differences between 
the groups based on IBS subtypes. 

In subgroup analyses, small and large bowel pH levels were not 
different according to gender, race, and age (<45 vs. >45). 
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Table 2. Small and large bowel intraluminal pH levels 


IBS (n=47) HC (n=10) P value 
Small bowel pH 
Q1 small bowel pH ey lies 0)7/ 23610) 0.885 
Q2 small bowel pH 67205 6.7+0.4 0.840 
Q3 small bowel pH 7.1+0.4 7.1+0.4 0.994 
Q4 small bowel pH 7.3404 75205 0.878 
Total small bowel pH 6.8+0.4 6.8+0.4 0.937 
Colonic pH 
Q1 colonic pH 6.4+0.6 6.8+0.4 0.011 
Q2 colonic pH be 207 Ji lles0)5 O73 
Q3 colonic pH 7,040.8 7.5+0.6 0.081 
Q4 colonic pH 7.22018 7.8+0.8 0.046 
Total colonic pH 6.8+0.7 7320.5 0.042 


HC, healthy control; IBS, irritable bowel syndrome; Q, quartile. 
The data are presented as mean+s.d. 


. *P.<0.05 
i IBS (n=47) ‘ 


CHC (n=10) 


* 


1 


Quartile 1 Quartile 2 Quartile 3 Quartile 4 Total 


Figure 1. Colonic pH levels. Quartile 1 and 4 and mean total colonic pH 
levels were significantly lower in irritable bowel syndrome (IBS; n=47) 
compared with healthy control (HC; n=10). The pH in the proximal colon 
(quartile 1) was the lowest in both groups. *P<0.05. 


Fecal SCFAs 

Fecal SCFAs were measured in all enrolled subjects (114 
IBS and 33 HC). Mean acetate, propionate, butyrate, lactate, 
and total fecal SCFAs levels were not significantly differ- 
ent between all IBS and HC (total SCFAs level: 92.1 mm vs. 
92.0 mM, P=0.996, respectively). Acetate had the highest levels 
followed by propionate and butyrate in both IBS and HC. 
Lactate was not detected, except for two cases (6.4mm in a 
M-IBS subject and 8.1mm in a D-IBS subject). The SCFAs 
profiles were similar across the different IBS subtypes (Table 4 
and Figure 2). However, acetate, propionate, butyrate, and 
total SCFAs levels were lower in C-IBS than in all the other 
subtypes and were significantly lower compared with D-IBS. 
The propionate level in C-IBS was significantly lower than in 
any other group. 
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Table 3. Large bowel intraluminal pH levels according to IBS subtype 


C-IBS (n=13) D-IBS (n=14) 
Q1 colonic pH 6.4+015 Ses 
Q2 colonic pH 7.009 6.9+0.8 
Q3 colonic pH 7.1+0.8 7.0+0.6 
Q4 colonic pH Heses0s) Hi oas0) 7 
Total colonic pH 7 O07 7.0206 


M-IBS (n=12) U-IBS (n=8) HC (n=10) 
6.4+0.5 5 OES: 6.8+0.4 
(Sy Sea 0KS) 6.5+0.4 PLO. 
Po llesOhs G720.e Tee Oe) 
TEMA) 7 O£07 7 820.8 
6.8+0.7 6.4+0.6° VSt0-5 


C-IBS, constipation-predominant IBS; D-IBS, diarrhea-predominant IBS; HC, healthy control; IBS, irritable bowel syndrome; M-IBS, mixed type IBS; Q, quartile; U-IBS, 


unspecified type IBS. 

The data are presented as meanss.d. 

8P<0.05 compared with Q1 colonic pHs of D-IBS and HC. 
>’P<0.05 compared with total colonic pH of HC. 


Table 4. Fecal SCFAs levels 


C-IBS (n=26) D-IBS (n=42) M-IBS (n=32) 
Acetate (mm) 47.8+23.4° (Sill ye ie Ze SS ae 2 
Propionate (mm) ING He pee ys) 20:89 18.4+10.2 
Butyrate (mm) e285: 16.4+8.9 TSSLO5 
Total SCFAs (mm) 13 0239.) Sis allaseie),7/ Se Vesey) (0) 


U-IBS (n=14) IBS total (n=114) HC (n=33) P value? 
63.9+40.9 58.0425.8 5 (ese ex's) OMe 
20.4+14.5 18.249.8 20,043,0 0331 
2) S222 83 1525121 | 15.4+8.9 0.875 
MOS Shs 110 92.1+44. 1 OZ 02333 0.996 


C-IBS, constipation-predominant IBS; D-IBS, diarrhea-predominant IBS; HC, healthy control; IBS, irritable bowel syndrome; M-IBS, mixed type IBS; SCFA, short-chain 


fatty acid; U-IBS, unspecified type IBS. 

The data are presented as meanss.d. 

4P-value for comparison between total IBS and HC. 

>P<0.05 compared with the acetate level of D-IBS. 

°P<0.05 compared with propionate levels of all the other groups. 
¢P<0.05 compared with the butyrate level of D-IBS. 

®P<0.05 compared with total SCFA levels of D-IBS, M-IBS, and HC. 


200 C-IBS (n=26) 
D-IBS (n=42) * 
[1 M-IBS (n=32) . * 
10 Fi U-IBS (n=14) ne 


All IBS (n=114) 


5 * [HC (n=33) 


(mM) 


80 * 


* 


j 


Butyrate 


40 


Acetate Propionate Total SCFAs 


Figure 2. Fecal short-chain fatty acids (SCFAs) levels by irritable bowel 
syndrome (IBS) subtypes. Total SCFA level was significantly lower in 
constipation-predominant irritable bowel syndrome (C-IBS) than in 
diarrhea-predominant IBS (D-IBS), mixed type IBS (M-IBS), and healthy 
control (HC). Total SCFA level averaged across all IBS was similar with that 
in HC. Propionate level in C-IBS was significantly lower than that of the 
other groups. Acetate and butyrate levels were significantly lower in C-IBS 
than in D-IBS. *P<0.05. U-IBS, unspecified type IBS. 


Intestinal transit time 
Intestinal transit time data were collected from 47 patients with IBS 
and 10 HCs. Overall the average intestinal transit times including 
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SBTT, CTT, and WGTT were not significantly different between IBS 
patients and HC. However, as expected, CTT and WGTT were sig- 
nificantly longer in C-IBS compared with those in D-IBS and M-IBS. 
There were no significant differences in SBTT between the groups. 
The whole-gut and segmental transit times are presented in Table 5. 


Correlation between colonic pH, fecal SCFAs, CTT, and 
symptoms 

Colonic pH levels positively correlated with CTT (Spearman’s 
correlation coefficient, SCC=0.33, P=0.013) and had a negative 
correlation trend with fecal SCFA levels (SCC=—0.23, P=0.079). 
Acetate, propionate, butyrate, and total SCFA levels correlated 
negatively with CTT and positively with stool frequency (total 
SCFAs vs. CTT, SCC=—0.44, P=0.001; total SCFAs vs. stool fre- 
quency, SCC=0.45, P<0.001). 

Colonic pH levels positively correlated with IBS-SSS (SCC=0.49, 
P=0.014), and similar positive trends were noted for abdominal 
pain and bloating with negative trends for IBS-QOL. CTT had cor- 
relation trend positively with IBS symptom scores and negatively 
with IBS-QOL. Fecal SCFA levels had no correlation with IBS 
symptom scores and QOL. The results of Spearman's correlation 
analyses are presented in Table 6. 
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Table 5. Gastrointestinal transit times according to IBS subtype 


C-IBS (n=13) D-IBS (n=14) M-IBS (n=12) 


WGTT (hours) S77 S)° 30.1+18.4 30.1+14.4 
SBTT (hours) 4.4+2.1 4.1+1.1 Sele leZ 
CTT (hours) 44.6+24.7° 22.4+16.0 23.1414.7 


Altered Colonic Bacterial Fermentation in IBS 


U-IBS (n=8) IBS total (n=47) HC (n=10) P value? 

45.3+25.2 39.6+24.0 48.2+31.9 0.344 
Sor 4.0+1.4 Eepell 0.348 

39.1+24.6 Sl MeeZ NS 34.7+21.1 0.632 


C-IBS, constipation-predominant IBS; CTT, colon transit time; D-IBS, diarrhea-predominant IBS; HC, healthy control; IBS, irritable bowel syndrome; M-IBS, mixed type 
IBS; SBTT, small bowel transit time; U-IBS, unspecified type IBS; WGTT, whole-gut transit time. 


@P-value for comparison between total IBS and HC. 
>’P<0.05 compared with WGTTs of D-IBS and M-IBS. 
¢P<0.05 compared with CTTs of D-IBS and M-IBS. 


Table 6. Spearman’s correlation coefficient in correlation analyses between variables 


Stool Abdominal Bloating IBS-SSS IBS-QOL 
frequency pain 

-0.10 0.29 0.29 0.492 -0.35 

0.45? -0.14 0.01 -0.33 On2 

—0.40 0.41 0.30 0.39 -0.28 


Total Total CTT Stool 
colon pH SCFAs consistency 

Total colon pH 1.00 —0.23 Oa: —0.30 
Total SCFAs —0.23 1.00 —0.44° —0.01 
iil O32 —0.44° 1.00 —0.64 
CTT, colon transit time; IBS-SSS, IBS-symptom severity scale score; IBS-QOL, IBS-quality-of-life; SCFAs, short-chain fatty acids. 
aPa0l05, 
a0 Oln 


HC 


carbonydrates 


IBS 


Figure 3. Hypothesized model. Colonic intraluminal pH is decreased, and short-chain fatty acid (SCFA) production is increased, suggesting higher fermen- 
tation in patients with irritable bowel syndrome (IBS) compared with healthy control (HC). SCFAs are effectively absorbed (>95% of SCFAs) in the gastroin- 
testinal (GI) tract, and only a small portion (~5% of total SCFAs) is excreted in feces. Both processes can affect and are affected by intestinal transit. 


DISCUSSION 

IBS is a common condition that affects nearly 10% of the gen- 
eral population (1,2) and contributes to a significant decrease in 
QOL along with an increase in health-care utilization. In spite 
of its high prevalence and impact on QOL, the pathophysio- 
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logical mechanisms underlying the disease remain unknown. 
Recently, the possible role of the intestinal microbiota in the 
pathogenesis of IBS has gained much interest (4,8,26). In this 
study, we investigated the role of intestinal bacterial fermenta- 
tion in IBS. 
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Intraluminal intestinal fermentation by colonic bacteria pro- 
duces gases such as hydrogen and carbon dioxide and SCFAs as 
secondary by-products (10,27,28). The predominant SCFAs pro- 
duced by bacterial fermentation are acetate, butyrate, and propi- 
onate. Lactate, which is a precursor for propionate and butyrate, is 
itself a by-product of fermentation. SCFAs are naturally acidic and 
cause a measurable drop in the pH of the intestinal lumen (28-30). 
In this study, we have measured intraluminal pH levels and fecal 
SCFA concentrations as surrogate markers for intraluminal intes- 
tinal fermentation and used these factors to assess the location and 
magnitude of intestinal bacterial fermentation in a well-character- 
ized cohort of patients with IBS and HC. 

We found that the mean total colonic pH levels were significantly 
lower in IBS compared with HC, potentially indicating higher intra- 
luminal bacterial fermentation in the IBS group. Of great impor- 
tance is our finding of consistently higher bacterial fermentation in 
the large bowel of patients with IBS compared with HC, whereas 
there were no differences in the pH levels in the small bowel between 
those two groups. Notably, these findings were consistent in all small 
bowel and large bowel segments and across all IBS subtypes. 

The intraluminal pH levels were lower at the proximal colon 
than the distal colon in both IBS and HC groups when we divided 
the quartiles based on the transit time. However, the quartiles 
based on transit time might not be the same as those based on 
geography, because if some patients might have mixtures of very 
rapid transit through one part of the colon and then very slow 
transit through the next, although these two segments might differ 
pathophysiologically. Although those cases were not considered 
common findings, it may be a limitation of our study. 

Small intestinal bacterial overgrowth (SIBO) is a condition in 
which an excess of colonic bacteria colonize the small intestine. 
The symptoms of SIBO and IBS overlap (31), and it has been sug- 
gested that the prevalence of SIBO is increased in IBS or that SIBO 
may have an etiological role in the pathogenesis of IBS, particularly 
in view of the improvement of symptoms with antibiotic treatment 
(32-35). However, the link between SIBO and IBS is still unclear, 
and there are some controversies regarding the real prevalence and 
the importance of SIBO in the pathogenesis of IBS (36). Our study 
findings indicate an increased bacterial fermentation in the large 
but not in the small bowel of patients with IBS, thus supporting the 
hypothesis of altered microbial composition and/or function in the 
colon rather than in the small bowel (e.g., in SIBO) in the patho- 
genesis of IBS. In addition, our finding of significant differences 
in total and segmental CTT, but not in SBTT between IBS and 
its subtypes and HC, further supports the importance of altered 
colonic rather than small bowel physiology in the pathogenesis of 
the disorder. 

The importance of SCFAs as possible etiological factors in the 
pathogenesis of IBS is suggested by observations from animal 
and human studies. Animal studies demonstrate that SCFA can 
initiate high-amplitude propagated colonic contractions (37) and 
accelerated intestinal transit and motility via intestinal release of 
5-hydroxytryptamine (38). Intracolonic infusion of 0.5% acetic 
acid enhanced sensitivity to colorectal distension (39). In a human 
study, the total concentration of SCFAs in jejunal secretion was 
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approximately four times higher in patients with SIBO than in 
healthy subjects (40). In addition, a recent study reported higher 
counts of acetic- and propionic-acid producing bacteria ( Veillonella 
and Lactobacillus) in IBS patients and an association between the 
higher levels of these SCFAs and GI symptoms and QOL (41). 
However, the studies in this area have generated conflicting find- 
ings on fecal organic acids. For example, a study conducted in 
patients with D-IBS showed that fecal SCFAs were decreased com- 
pared with HC (42). In our study, there were no significant differ- 
ences in the mean levels of fecal SCFAs between IBS (all subtypes) 
and HC, and the levels of fecal SCFAs did not correlate with IBS 
symptom severity. However, we were able to demonstrate that fecal 
SCFAs levels in C-IBS patients were significantly lower than those 
in D-IBS patients and tended to be lower than those in all the other 
IBS subtypes and HC. In addition, as expected, SCFAs levels nega- 
tively correlated with CTT, which is consistent with our finding of 
a positive correlation between colonic pH and CTT, indicating that 
higher fermentation with higher production of SCFA is associated 
with increased motility and shorter transit times. An alternative 
explanation can be reduced absorption of SCFAs in patients with 
rapid intestinal transit due to a shorter transit time. The difficulties 
in interpretation of these findings reflect the multifactorial deter- 
minants of fecal SCFAs. Intraluminal SCFAs are very efficiently 
absorbed in the colon, and only 5 to 10% of the SCFAs produced 
by bacterial fermentation are excreted and can be measured in the 
stool (43-45). The correlation between fecal SCFAs levels and CTT 
and the differences between IBS subtypes suggest that fecal SCFAs 
levels may be controlled by CTT more dominantly than fermenta- 
tion degree; thus, fecal SCFAs may not be a sensitive marker to 
estimate intraluminal bacterial fermentation. 

In addition to magnitude and location, we examined the associ- 
ations between intestinal fermentation and clinical presentation of 
IBS. Unlike what we expected, the colonic pH levels positively cor- 
related with IBS symptom scores including IBS-SSS. This finding 
suggests that other factors such as abnormal intestinal motility and 
psychological disturbances may be more important than bacterial 
fermentation in determining symptoms severity. 

The wireless motility SmartPill capsule measures pH, pressure, 
and temperature in real-time throughout the GI tract (21). Overall, 
pH profiles in the GI tract are characterized by an abrupt rise in 
pH from acid to near neutral as the capsule exited the stomach, a 
slow continued rise in pH through the small bowel until reaching 
the large bowel where pH falls more than 1 unit, and subsequently 
there is a slow rise in pH through the colon (21,46). The pH drop at 
the ileocecal junction is well documented in studies using ingest- 
ible radiotelemetry capsules (47,48). Time between an abrupt rise 
of pH in duodenum and a fall of >1 pH unit in ileocolonic junc- 
tion was taken as SBT'T. CTT was assessed by measuring the time 
required from cecal entry to body exit (loss of signal and/or an 
abrupt temperature drop) (21). A recent scintigraphic study using 
a radioisotope-labeled WMC has validated this pH change at the 
ileocolonic junction and has shown that the fall in pH observed 
with WMC corresponds to the time of arrival of the WMC into 
the cecum or ascending colon (49). Several studies have confirmed 
the utility of WMC in quantifying CTT when compared with 
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the radiopaque marker method. There was a good agreement 
between the WMC and radiopaque marker transit results (22,46). 
The WMC is a validated test; however, there are some confound- 
ing issues with the test. The pH drop at the ileocecal junction is 
occasionally (<5%) not clearly identifiable, and device failure is 
reported in <3% of cases. In our study, the pH landmarks were well 
identified in all cases, and there were no capsule failures. 

To our knowledge, there has been no previous study to evaluate 
bacterial fermentation in the whole intestine by assessing intralu- 
minal pH changes in patients with IBS. However, our study may 
have some limitations. First, although subjects were asked to keep 
their regular diet during the study, we did not have a detailed 
dietary data and could not control for dietary effects on intestinal 
physiology and fermentation processes including intraluminal pH 
and SCFA production. Second, we did not investigate the intesti- 
nal bacterial community, so we do not know which bacterial spe- 
cies or genera were altered and how these alterations relate to the 
variables of interest. Third, we did not evaluate gas production, 
such as hydrogen and methane, which might be associated with 
symptom generation. Finally, the analyses of the wireless motility 
capsule (SmartPill) data included a relatively small number of sub- 
jects and involved some multiple comparisons, particularly in the 
sub-analysis by IBS subtypes. Further large-scale studies, includ- 
ing the analyses of dietary factors, gas production, and bacterial 
communities are needed. These studies may enable to associate 
alterations in bacterial fermentation with specific diet or bacterial 
communities and provide insights on how these associations relate 
to the altered physiology and symptoms observed in IBS. 

In conclusion, we found that colonic intraluminal pH is 
decreased, suggesting increased bacterial fermentation, in IBS 
patients across all subtypes compared with HC, and that this pro- 
cess is more prominent in the proximal colon. The altered intesti- 
nal fermentation and transit in IBS appear to be in the colon and 
not in the small bowel. Higher levels of fecal SCFAs and lower 
levels of intraluminal pH are associated with decreased CTT. 
The hypothesized model of our study results was illustrated in 
the Figure 3. Our study further supports the notion that altered 
colonic bacterial fermentation has an important etiological role 
for in the pathogenesis of IBS. Future studies should investigate 
the degree to which altered intestinal fermentation is a cause of 
IBS symptoms or an effect of IBS-related altered physiology by, if 
possible, direct measurement of bacterial fermentation and clarify 
the possible clinical benefit of targeting intestinal fermentation in 
the treatment of functional GI disorders. 
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Study Highlights 


WHAT IS CURRENT KNOWLEDGE 


Alterations in the gut microbiota have a role in irritable 
bowel syndrome (IBS). 


Alterations in intestinal fermentation lead to certain 
physiological abnormalities such as intraluminal excessive 
gas production and altered motility. 


Diet low in fermentation substrates such as poorly absorb- 
able, highly fermentable carbohydrates offers considerable 
symptom relief in IBS. 


The understanding of the role of altered intestinal fermen- 
tation, including its magnitude and location, in the 
pathogenesis of IBS is not completely understood. 


WHAT IS NEW HERE 


Colonic intraluminal pH decreased, indicating higher 
intestinal fermentation, in IBS patients across all subtypes 
compared with healthy controls. 


The intestinal fermentation in IBS ts altered in the colon 
but not in the small bowel. 


Higher levels of fecal SCFAs and lower levels of intraluminal 
pH are associated with decreased colon transit time. 
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Neuronal correlates of placebo 


In chronic FGIDs 


QiQi Zhou and G. Nicholas Verne 


Placebo analgesia is increasingly appreciated in many difficult to treat 
chronic functional gastrointestinal disorders such as IBS. However, 
investigations of interactions between psychological and biological 
placebo factors are still in early stages. Now, technologies have been 
developed that enable neural mechanisms of placebo analgesia to be 


studied more directly in humans. 
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IBS is one of the most common functional 
gastrointestinal disorders (FGIDs) in which 
the underlying pathophysiology of the dis- 
order remains poorly understood. Some 
patients with IBS have heighted percep- 
tion to both somatic and visceral stimuli, 
suggesting abnormalities in central and/or 
peripheral pain processing (a theory sup- 
ported by neuroimaging studies that reveal 
altered brain activation in response to 
nociceptive stimuli).'* This ongoing sensi- 
tization leads to altered input to the central 
neuroaxis, which might induce long-term 
microstructural reorganization of the brain 
and subsequent failure of conventional 
therapies and a robust placebo response in 
patients with IBS.° A new study by Schmid 
et al.° sought to further understand the 
neural circuitry of placebo analgesia. 
Schmid and colleagues® compared 
nociceptive rectal distension in patients 
with IBS, patients with ulcerative colitis 
and healthy individuals as controls after 
administration of a saline placebo with 
either instructions of pain relief (placebo; 
patients were told they were receiving a 
highly potent analgesic drug when in fact 
they were administered saline) or neutral 
instructions (control; patients were truth- 
fully told that saline was administered). 
Neural activation as assessed by functional 
MRI (fMRI), which was evaluated along 
with ratings of perceived and expected pain, 
as well as measures of salivary cortisol con- 
centrations and negative affectivity. Patients 
with IBS did not exhibit neural down- 
regulation of rectal-distension-induced pain 


during placebo analgesia, whereas controls 
and patients with ulcerative colitis both 
demonstrated marked reductions in neuro- 
nal activity in the cingulate and somato- 
sensory cortex. Interestingly, patients with 
IBS displayed an enhanced activation of the 
somatosensory cortex in the placebo condi- 
tion. Psychological testing further revealed 
that depression scores correlated with a 
weaker placebo analgesia. 
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Much progress has been made over the 
past century in characterizing the neural 
circuitry that controls inhibition of pain. 
Ascending and descending pain pathways 
terminate within cortical and subcorti- 
cal structures that are the basis for pain- 
modulatory pathways such as the amygdala, 
thalamus, hypothalamic nuclei, rostroven- 
tral medulla and midbrain periaqueductal 
grey (Figure 1). These structures are central 
targets of transmission and modulation of 
nociceptive pain pathways. Intact descend- 
ing inhibitory modulation of nociceptive 
afferent impulses is paramount to enable 
adequate pain control in many chronic pain 
disorders such as IBS. Several recent techno- 
logies and experimental approaches have 
been developed whereby neural mechanisms 
of analgesia can be studied more directly in 
human study participants. These approaches 
include functional imaging methods such 
as PET or fMRI. One or several of these 
measurements can be combined with valid 
measures of expectancy and desire for pain 
relief, as well as with reliable measurement 
of pain intensity and pain unpleasantness. 
Such an approach makes it possible to 


Hypothalamus 


Figure 1 | Pain-modulatory pathways. This diagram depicts ascending (left) and descending 
(right) pain pathways (red lines) that terminate within cortical and subcortical structures and are 
the basis for pain-modulatory pathways and include the amygdala, hypothalamic nuclei or 
hypothalamus, thalamus, midbrain PAG, and the RVM. These cortical and subcortical structures 
are central targets of transmission and modulation of nociceptive pain pathways. Abbreviations: 
PAG, periaqueductal grey; RVM, rostroventral medulla. 
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simultaneously examine how the placebo 
analgesic effect is expressed at different 
levels. The potential for analysis of relation- 
ships between cognitive mediating factors 
and cerebral pain-modulating circuitry 
in the case of placebo analgesia was first 
illuminated by two early studies. 

The first study used neuroimaging of 
displacement of a radioactive opioid ligand 
({"C] carfentanil) to examine the function 
of the opioid system and u-opioid receptors of 
the brains of healthy humans undergoing 
sustained pain.’ Sustained pain induced 
regional release of endogenous opioids in 
brain regions known to modulate pain: the 
rostral anterior cingulate cortex; ipsilat- 
eral amygdala; and contralateral thalamus. 
Activation of the p-opioid receptor system at 
all of these sites was associated with reduc- 
tions in the sensory and affective ratings of 
pain. A second study used PET to demon- 
strate that both opioid and placebo analge- 
sia are associated with increased activity 
in the rostral anterior cingulate cortex in 
healthy humans.* They also observed a 
covariation between activity in the rostral 
anterior cingulate cortex and the lower 
brainstem during both exogenous opioid 
and placebo analgesia, but not during pain 
alone. Both investigations suggest the feasi- 
bility of relating psychological mediating 
factors of placebo analgesia to brain regions 
involved in pain modulation. 


GG Some patients with IBS have 
heighted perception to both 
somatic and visceral stimuli... ¥F 


The possibility of a refined analysis of 
placebo effects within studies of patients 
with FGIDs has far-reaching scientific and 
medical implications.’ Our present limited 
capacity to ascertain, measure and control 
for placebo effects is at the heart of complex 
and difficult questions about pharmaco- 
logical therapies for pain as well as many 
nonpharmacological therapies, particularly 
those related to surgery, hypnosis, electrical 
stimulation and alternative complemen- 
tary and medical treatments. One potential 
improvement for investigations into the 


placebo effect would be to include a clini- 
cally relevant type of stimulation that would 
be more likely to activate at least some of 
the mechanisms believed to be important in 
placebo analgesia than would typical experi- 
mental pain stimuli. We need to know more 
about how mediating factors determine the 
presence and magnitude of placebo analge- 
sia, as well as about their psychological 
and neural mechanisms. Advances in the 
measurement of placebo analgesia will not 
only contribute to a more refined under- 
standing of the mechanisms in placebo 
analgesia, but is also likely to contribute to 
a more precise determination of the magni- 
tude of placebo analgesia. Indeed, the inabil- 
ity of patients with IBS to effectively engage 
neural downregulation of visceral pain 
signals might help to illuminate the high rate 
of failure of many pharmacological therapies 
for functional bowel pain.®’ Strategies can 
be developed whereby it is possible to assess 
most or all of the relevant mediating factors 
that contribute to placebo effects. 

The latest neuroimaging studies raise 
several important considerations regarding 
the understanding of the neural mecha- 
nisms of the placebo effect in patients with 
FGIDs.®"° Interestingly, patients with IBS in 
the study by Schmid et al.° did not exhibit 
visceral hyperalgesia. A subset of patients 
with IBS (30-40%) exhibit visceral hyper- 
sensitivity to colonic distension,’ and it is this 
group of patients that would be expected not 
to adequately engage neural downregulation 
of visceral pain. Future studies are needed to 
compare neural placebo responses between 
patients with IBS with and without visceral 
hypersensitivity. In addition, other pain 
transmission pathways need to be assessed 
during placebo conditioning in patients 
with IBS, including the hypothalamic nuclei, 
rostroventral medulla and the midbrain 
periaqueductal grey. Also, a more detailed 
analysis of the patients’ desire and expec- 
tancy for pain relief during placebo analgesia 
might provide additional important infor- 
mation along with imaging in patients with 
IBS before and after cognitive-behavioural 
therapy that might augment descending 
inhibitory pain pathways. Finally, future 
imaging studies are needed to evaluate 


neuronal deactivation during placebo analge- 
sia, which could delineate additional cerebral 
patterns important in the placebo response. 
Thus, it is possible that, in the future, pla- 
cebo analgesia could be used to augment 
established medical therapies for FGIDs. 
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Tipping elements in the human intestinal 
ecosystem 
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The microbial communities living in the human intestine can have profound impact on our 
well-being and health. However, we have limited understanding of the mechanisms that 
control this complex ecosystem. Here, based on a deep phylogenetic analysis of the intestinal 
microbiota in a thousand western adults, we identify groups of bacteria that exhibit robust 
bistable abundance distributions. These bacteria are either abundant or nearly absent in most 
individuals, and exhibit decreased temporal stability at the intermediate abundance range. 
The abundances of these bimodally distributed bacteria vary independently, and their 
abundance distributions are not affected by short-term dietary interventions. However, their 
contrasting alternative states are associated with host factors such as ageing and overweight. 
We propose that the bistable groups reflect tipping elements of the intestinal microbiota, 
whose critical transitions may have profound health implications and diagnostic potential. 
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our intestinal tract can have profound impact on well- 

being and health!~*. Although the compositional and 
functional properties of the intestinal microbiota have been 
studied extensively, we have limited understanding of the 
mechanisms that control this complex ecosystem?®. A 
fundamental question is whether the overwhelming variability 
of the microbiota composition occurs along smooth phylogenetic 
gradients or through abrupt switches between alternative stable 
states’. In ecology, the concept of alternative stable states 
separated by tipping points has emerged as a theoretical 
framework with major practical implications’. 

Here we demonstrate the same properties in the human 
intestinal ecosystem. While previous studies have focused on 
analysing ecosystem-wide configurations in human intestinal 
microbiota””!”, our analysis of a thousand western adults shows 
that there are distinct groups of bimodally distributed bacteria 
that tend to be either very abundant or nearly absent, and exhibit 
reduced stability at the intermediate abundance range between 
these two contrasting states. We propose that the bistable groups 
we identified represent ‘tipping elements’!!, specific components 
of the intestinal microbiota that exhibit alternative stable states 
linked to the overall ecosystem state and our physiology. While 
further studies are needed to establish the possible causal 
relations, the bistable taxa can serve as indicators of the 
community state and its potential health associations, and 
resetting these ‘bacterial switches’ may be a radically new way 
to approach the rapidly growing number of health issues related 
to the intestinal microbiota, changing the way we look at 
management of the intestinal ecosystem. 


Ty: versatile and dynamic microbial ecosystems inhabiting 


Results 

Identification of alternative states. To systematically investigate 
the hypothesis of alternative stable states in the intestinal 
microbiota, we analysed 130 genus-like phylogenetic groups that 
cover the majority of the known bacterial diversity of the human 
intestine!? (Supplementary Table 1). Our analysis is based on 
> 2,000 standardized phylogenetic microarray hybridizations that 
represent the faecal microbiota of a thousand adults (n = 1,006; 
18-77 years) from 15 western countries (Europe and the United 
States; see Methods). On the basis of this extensive data 
collection, we identified several bacterial groups exhibiting 
robust bimodal abundance distributions (Fig. 1; Supplementary 
Table 2), including Dialister spp., relatives of Bacteroides fragilis, 
Prevotella melaninogenica, P. oralis and two groups of the 
uncultured Clostridiales (UCI and UCH; Supplementary Note 1). 
The strongly correlated P. melaninogenica and P. oralis were 
combined into a single Prevotella group (Supplementary Note 2). 
The other 124 genus-like groups that we investigated were 
either rare or exhibited symmetric or skewed abundance types 
(Fig. 2; Supplementary Table 1). 

To detect alternative states in the intestinal bacteria, we used 
potential analysis that was recently applied to study analogous 
state shifts in climatology and forest ecosystems!*!4. This 
theoretical framework for state identification in stochastic 
dynamical systems is well suited for studying the remarkably 
variable intestinal microbiota. We used bootstrap analysis to 
ensure that the findings were robust to random sampling 
variations, and selected for further analysis the taxonomic 
groups that showed consistent evidence for multimodality with 
a bootstrap support of >98% for Dialister spp., P. oralis, 
P. melaninogenica, UCI and UCII groups, and 93% for the B. 
fragilis group. The evidence was the strongest for two alternative 
states in all these groups, and further supported by an alternative 
approach, the prediction strength index (PSI)!°!°, which 


provided strong)? evidence of bimodality (PSI>0.90) for 
Dialister spp., UCI, UCI and relatives of P. melaninogenica 
and P. oralis, and a moderate support for the B. fragilis group 
(PSI > 0.80). 

To control the possibility that the bimodal distributions could 
be explained by diet or other independent environmental factors 
due to potentially unbalanced sampling, we confirmed the 
bimodality in distinct subsets of the data collection, controlled 
potential cross-hybridization effects between closely related taxa 
(Supplementary Note 2) and investigated the available host and 
environmental parameters (see Methods). In all cases, except the 
B. fragilis group, the bimodality was supported in independent 
sample sets analysed with different DNA extraction methods, 
including two sets of samples extracted with different combina- 
tion of mechanical and chemical lysis steps (N=401 and 
N= 287). Moreover, the 287 samples were all from central 
Europe, and thus expected to be less heterogeneous regarding the 
long-term dietary patterns compared with the entire data 
collection that included subjects from the United States and 
from 14 countries in Europe. Since long-term habitual diets are 
correlated with geographic regions, the verification of bimodality 
in two different data sets with different geographical distributions 
supports our conclusions by reducing the possibility that the 
observed bimodality is driven by country-specific dietary or other 
life-style differences within the study cohort. Although all DNA 
preparations were identically processed for the microarray 
analysis, the verification in independent subsets of the data 
collection further helps to exclude the possibility that the 
detected bimodality derives from technical variation in sample 
processing. 

Analysis of an additional data set of 138 individuals from five 
dietary intervention trials indicated that the observed bimodality 
is robust against short-term dietary interventions as we did not 
observe significant population-level differences in the mean 
abundances between the baseline and post-intervention samples 
in the bimodal taxa in these studies (P>0.05; t-test; Benjamini- 
Hochberg correction; see Methods). Remarkably, the bimodal 
patterns were also indicated among the 46 placebo-treated control 
subjects in these controlled dietary intervention _ trials 
(Supplementary Fig. 1), further supporting the conclusion that 
the population-level bimodal patterns and the observed state 
shifts cannot be explained by short-term dietary differences. 
Whereas the observed states were robust to short-term dietary 
interventions, this does not exclude the possibility that targeted 
dietary interventions could be potentially used to manipulate 
specific bistable groups. On the basis of the available data 
collected in intervention trials, a pronounced dietary effect on the 
bimodal taxa was observed, for instance, on a formula-based 
weight-loss diet during which 29% (9/31) of the subjects exhibited 
a switch from the high to the low-abundance state of Dialister 
spp. These profound individual changes were, however, masked 
by remarkable within-state variation among the other subjects so 
that the average group-level fold change following the interven- 
tion was moderate and only borderline significant (log;9>FC = 
— 0.39; P=0.08; paired Wilcoxon test). Although the observed 
states appear predominantly robust to dietary variation, we and 
others have shown that the abundance of the implicated bacteria 
can be to some extent affected by specific interventions as the 
quantity and quality of consumed carbohydrates can alter the 
abundance of Bacteroides and Prevotella species!®-!® as well as 
that of Dialister spp.!? However, the reported changes were 
relatively small (average |log;j>FC|<0.5) in the bimodal groups 
that we now report’”’’ and thus induced mainly within-state 
variation according to the present analysis. Hence, further studies 
are needed to assess the possibility to induce state switches in 
bistable taxa based on dietary or other interventions. 
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Figure 1 | The bimodal bacteria. Logarithmic abundance distributions of the six bimodal phylogenetic groups that exhibit robust alternative states of low 
(blue) and high (red) abundance across intestinal microbiota of 1,006 western adults. The UCI and UCII refer to the uncultured Clostridiales | and Il, 
respectively. The frequency of the observations is shown as a function of the phylogenetic microarray logig signal. 
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Figure 2 | Bacterial abundance types. Bacterial abundance types include 
symmetric, skewed and bimodal logarithmic abundance distributions 
(Supplementary Table 1). The skewed types include prevalent left- and 
right-skewed types as well as rare bacteria. The bimodal types (Fig. 1) 
include cases with two distinct peaks, and cases with a peak at low 
abundance combined with a more widely varying fat tail of high-abundance 
subjects. The population frequencies of the log abundance across the 1,006 
western adults are shown for representative examples from each category: 
Anaerostipes caccae et rel. (symmetric); Serratia spp. (rare); Streptococcus 
bovis et rel. (right skewed); Faecalibacterium prausnitzii et rel. (left skewed); 
P. oralis et rel. (bimodal distribution with two distinct peaks); and uncultured 
Clostridiales | (fat tailed, bimodal distribution with a variable high- 
abundance state). 


Temporal dynamics. The theory of stochastic dynamical systems 
predicts® that alternative stable states are separated by an 
intermediate unstable region, called a tipping point, where even 
small fluctuations may lead to an abrupt, self-propagating shift to 
an alternative state. While this can give rise to bimodal 
abundance distributions®, as in our reported bimodal bacteria 
that tend to be either rare or abundant in most subjects, the 
observed bimodality could also be caused by other factors such as 
diet or yet unknown host factors, dividing the study cohort into 


two distinct subpopulations. Thus, bimodality of states provides 
only indirect evidence for bistability. In bistable systems, 
observations in the intermediate region between the alternative 
states may correspond to subjects in a transitional phase between 
the states®, for instance, due to changes in diet or physiological 
state. We therefore looked into the temporal dynamics of the 
states based on a subset of 78 subjects (27-63 years) in our main 
cohort with additional follow-up samples within 1-9 months 
from the baseline (see Methods). This complements our cross- 
sectional analysis that represents a static snapshot of the intestinal 
ecosystem in each individual. A closer look at the temporal 
fluctuations (Fig. 3) indicated that subjects closer to the tipping 
point indeed had larger temporal deviations, suggesting that the 
intermediate abundances appear less attractive than the 
contrasting alternative states. We confirmed this by quantifying 
the stability of bacterial abundances at the intermediate 
abundance range. The observed deviations from the baseline in 
the later time points were higher in subjects whose baseline 
abundances were closer to the intermediate (50%) abundance 
range, as indicated by significant negative correlation (Spearman’s 
coefficient p) between these variables for the Dialister spp. 
(— 0.54), P. melaninogenica (—0.40), P. oralis (—0.34), UCI 
(— 0.37), UCII (—0.53) and B. fragilis (— 0.19) groups. These 
correlations were significant (P<0.01; upper tail (AS89) test; 
Benjamini—Hochberg adjustment) for all bimodal groups, except 
B. fragilis (P = 0.1). Moreover, comparison between the altogether 
60 prevalent taxa showed that the bimodal groups had 
significantly reduced intermediate stability compared with the 
other groups (Fig. 4; P<0.002; hyper-geometric test). This 
observation supports the hypothesis of two alternative stable 
states of low and high abundance divided by an unstable tipping 
point at the intermediate abundance range in these taxa, in line 
with -_ theoretical predictions outlined above for bistable 
systems®. Furthermore, the comparison between bimodality and 
intermediate stability in Fig. 4 suggested the presence of further 
bistable bacteria (Firmicutes), including Clostridium spp. (sensu 
stricto) and relatives of C. colinum, Lactobacillus plantarum and 
uncultured Mollicutes, as well as the C. difficile group that 
exhibits exceptionally low intermediate stability and a moderate 
support for bimodality, in line with earlier observations that a 
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Figure 3 | Temporal variation. Temporal variation in each bimodal group during the follow-up interval (1-9 months from the baseline for 95% of the 

subjects, see Methods). Each horizontal line indicates the abundance range of the indicated bacteria for a single subject. The 78 subjects (horizontal lines) 
are ordered based on their mean abundance. The red lines highlight subjects exhibiting a state shift (Supplementary Fig. 3) across the estimated tipping 
point (dashed vertical lines). The differences in the follow-up times did not significantly affect the observed fluctuations (P>0.05; Pearson correlation; 


Benjamini-Hochberg adjustment). 


small fraction of healthy adults carry C. difficile?®, one 
of the members of this group. Toxin-producing derivatives of 
C. difficile have been identified as the cause of severe and 
recurrent antibiotic-associated diarrhoea, and recent faecal 
transplantations have been shown to provide efficient means to 
manipulate its alternative states and treat the disease, resulting in 
complete recovery”). It is currently not known whether the 
alternative states of C. difficile as part of the normal microbiota 
are associated with the infection risk. While these observations 
suggest bistable properties for these taxa, we could not confirm 
their alternative states in the other tests. Furthermore, we 
analysed the temporal stability of the above-mentioned states, 
quantified as the fraction of subjects remaining in the initial state 
after a 3-month interval as estimated by Kaplan-Meier survival 
analysis (see Methods). The states with higher population 
frequencies were more stable in general (Supplementary 
Fig. 2a), in line with predictions based on the stationary-state 
continuous Markov process (Supplementary Fig. 3) and 
consistent with the idea that changes in species abundance 
distributions can be used to infer changes in the stability and 
resilience of alternative states’>. The high-abundance Prevotella 
group was a notable exception as one of the least frequent but yet 
one of the most stable states. The alternative states of the reported 
bistable groups were also more stable than expected based on 
empirical distributions derived from the unimodal taxa 
(Supplementary Fig. 2b); the Prevotella group exhibited less 
state mixing than any other group (100%), followed by UCI (less 
mixing than in 93% of the comparison groups), Dialister spp. 
(83%) and UCII (78%). The B. fragilis group was again an 
exception (35%). 


Analysis of individual phylotypes and methanogenic archaea. 
Phylotype-level analysis of the six bimodal groups indicated that 
the observed bistability is often associated with particular phy- 
lotypes, and absent in others (Supplementary Figs 4 and 5; 
Supplementary Note 3). This could indicate potential antagonism 
or metabolic specialization between the implicated phylotypes. In 


certain groups, such as the UCI and UCI, the most prevalent 
phylotypes exhibited stronger bimodality than their higher-level 
genus-like group (Supplementary Fig. 4). Of the bistable groups 
indicated in Fig. 4, only the Dialister spp., relatives of P. mela- 
ninogenica and P. oralis, and UCI/UCII groups, whose bistability 
was systematically supported in the different analyses, had pre- 
valent phylotypes according to our criteria. Moreover, we 
observed notable variation within the genus Prevotella as P. 
ruminicola exhibited only moderate correlation with the com- 
bined P. oralis/P. melaninogenica group (r=0.39). The most 
strikingly bimodal bacteria were uncultured phylotypes, as in the 
case of the P. melaninogenica and P. oralis groups, where the type 
species were detected only at low abundance, but the related 
uncultured bacteria exhibited bimodal distributions (Supple- 
mentary Fig. 4). D. invisus (Dialister spp.) is an example of a 
cultured species with a strong bimodal abundance distribution 
(Supplementary Fig. 5). 

We augmented the phylotype analysis with quantification of 
methanogenic archaea as this group is not targeted by the Human 
Intestinal Tract Chip (HITChip) phylogenetic microarray. The 
analysis of alternative states in methanogenic archaea is of 
interest as the current evidence is inconclusive regarding their 
prevalence in human populations. Traditionally, only a fraction of 
individuals have been considered as carriers, but a recent 
molecular study reported that the dominant archaeon, Methano- 
brevibacter smithii, is present in >95% of individuals**. We 
observed indications of bimodality in Methanobrevibacter spp. 
based on the quantitative PCR-based’? enumeration, which was 
available for 53 subjects from our main cohort. On the basis of 
this analysis, we found 47% of these unique subjects to be carriers 
of Methanobrevibacter. Moreover, the logarithmic abundance 
distribution (Fig. 5) exhibited clear bimodality, representing 
carriers and non-carriers. Quantitative PCR-based data for these 
archaea were available for 452 additional samples from 189 
unique subjects, of which 57% were found to be carriers. Among 
the 53 subjects in common with our main cohort, the abundance 
of these archaea had moderate positive correlation with the 
Prevotella group (r=0.32; P=0.02; t-statistics), UCI (r=0.28; 
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Figure 4 | Bimodality is associated with reduced intermediate stability. 
Analysis of the 60 prevalent taxa indicates a group of bistable bacteria with 
robust bimodal abundance distributions (> 90% bootstrap support; vertical 
axis) and reduced intermediate stability (o < — 0.3; horizontal axis), as also 
indicated by the corresponding frequency histograms (side panels). The 
bimodality scores are shown for the 401 samples extracted by mechanical 
lysis2°; the stability estimates are based on the 78 follow-up subjects. The 
black symbols indicate the bistable groups whose bimodality was supported 
in multiple data sets, including the Dialister spp. (square), P. melaninogenica 
(large diamond), P. oralis (small diamond), UCI (small circle) and UCII 
(large circle) groups. These show also significantly reduced intermediate 
stability compared with the other prevalent groups (9 < — 0.3; P<0.002; 
hyper-geometric test). The other bistable taxa (white circles) whose 
bimodality we could not verify in the other data sets include Firmicutes 
(Clostridium spp. (sensu stricto), relatives of C. colinum, Lactobacillus 
plantarum and uncultured Mollicutes). The bimodal B. fragilis group (grey 
upward triangle) has only moderately reduced intermediate stability 

(p = — 0.19). The C. difficile group (grey downward triangle) has the third 
lowest intermediate stability. 


P=0.04) and UCI (r=0.23; P=0.1) groups that we also found 
to be associated with increased community diversity and 
metagenomic richness (Supplementary Fig. 6; Supplementary 
Note 4). 


Tipping elements and the ecosystem state. While multimodality 
in specific microbial groups, notably the Prevotella genus, has 
previously been reported!®, our analysis complements and 
extends these earlier studies by providing increased 
phylogenetic depth and sample size, and by suggesting 
previously unreported taxonomic groups exhibiting bistable 
properties that associate to the overall intestinal ecosystem state 
and host health. The six bimodal genus-like groups that we 
identified exhibited only weak preferences for mutual co- 
occurrence or exclusion (|p|<0.16 among the 401 samples with 
mechanical lysis; Fig. 6), except a positive correlation between 
UCI and UCII (9 =0.50) and between P. melaninogenica and 
P. oralis (p =0.97), which we have combined into a single 
Prevotella group. The low- and high-abundance states of the 
resulting five bimodal groups yield altogether 32 (2°) possible 
state combinations with varying population frequencies (Fig. 6), 
suggesting that the alternative stable states associated with 
distinct subcommunities often coexist in a host in different 
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Figure 5 | Abundance distribution of Methanobrevibacter, the dominant 
methanogenic archaeon in the human intestine. Quantitative PCR-based 
(gPCR)° logarithmic abundance distribution, expressed as 16S rRNA 
gene copies per gram of faeces, indicates bimodality of this genus. 
Methanobrevibacterium spp. were detected in 47% of the 53 unique subjects 
from our main cohort for which the qPCR data were available. For subjects 
where Methanobrevibacterium spp. were not detected, the abundance is set 
at the detection limit of the assay. The colours indicate the two groups of 
subjects where the Methanobrevibacter spp. are absent or below the 
detection limit (blue), or detected (red). 


combinations. We observed, for instance, a substantial overlap 
between the B. fragilis and Prevotella groups, with no significant 
differences in B. fragilis abundance between the low- and high- 
abundance Prevotella subjects (logigFC<0.05; t-test; P>0.05). 
Similar results on the genus Bacteroides have been reported 
earlier”®, in contrast with other studies that have reported a 
notable trade-off between Bacteroides and Prevotella spp®??’. 
The bistable groups also covaried with other bacteria (Fig. 7), 
suggesting that their state shifts can be associated with wider 
changes in the overall ecosystem composition, and could be 
potentially used as indicators of the overall community state. 
Establishing possible causal relations between the bistable groups 
and ecosystem composition will require further studies, however. 

Ecosystem-level investigation based on principal component 
analysis (PCA; Supplementary Fig. 7) and the PSI!9!> indicated 
two alternative states in the overall microbiota composition, 
roughly corresponding to low and high Prevotella, in line with 
previous reports”!°, The main principal component explained 
16% of the overall variation in the data and exhibited a 
continuous gradient associated with Bacteroides spp. and 
relatives of Oscillospira guillermondii and Faecalibacterium 
prausnitzii. The second principal component exhibited a 
bimodal distribution associated with the Prevotella (relatives of 
P. oralis and P. melaninogenica) group, explaining 13% of the 
total variation. We did not observe robust bimodality on the other 
major principal components. A complementary analysis using the 
principal coordinates analysis with UniFrac distances”® yielded 
similar results, although the bimodality of the second (Prevotella) 
principal axis was less pronounced than in the standard PCA 
(Supplementary Fig. 7). Whereas the genus Prevotella and 
Bacteroides spp. have been earlier suggested as drivers of 
community-level enterotypes’, our observations extend and 
complement this analysis by identifying specific bacteria 
exhibiting bistable properties within these groups and analysing 
their temporal dynamics, and by demonstrating that these groups 
in fact frequently co-occur in a host in varying combinations. The 
high-abundance state of the Prevotella group had the most 
dominating effect on the ecosystem, with an average relative 
abundance of 10% among the subjects with the high-abundance 
state (8 and 2% for the co-occurring P. melaninogenica and 
P. oralis groups, respectively). Due to their lower relative 
abundances (<2% in the high-abundance state) compared with 
the Prevotella group, the bimodal patterns in the B. fragilis, 
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Figure 6 | The tipping elements co-occur in various combinations. The bimodal groups (rows) co-occur in various combinations within the study 

population. Shading indicates the relative abundance (logj9) for each group with respect to the identified tipping point between the alternative states of low 
(blue) and high (red) abundance. The 1,006 subjects are ordered based on the combination frequencies (top row). The most frequent combination (18%) 
corresponds to the high-abundance state of the UCI, UCII and B. fragilis groups combined with the low-abundance state of the Dialister spp. and Prevotella 
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Figure 7 | Covariation between the bimodal groups and other taxa. 
Summary of correlations between the bimodal groups (columns) and the 
124 other genus-like groups targeted by the HITChip microarray; significant 
correlations with 25 groups (rows; P<0.05; |r| >0.25; Pearson correlation) 
were consistently observed in independent sample sets analysed using two 
different DNA extraction methods (mechanical lysis; n= 401 and 
enzymatic lysis-based method; n= 287), and in 100 bootstrap data sets 
sampled from the overall data collection (n=1,006). The remaining 99 
groups that did not show significant correlations have been removed for 
clarity. The 32 correlations that were found to be consistently significant in 
all tests are indicated by ‘+’. Shading indicates Pearson correlation 
between the taxa across the 1,006 western adults. 


Dialister spp., UCI and UCI groups are more easily masked by 
variability in other, more smoothly varying taxa that correlate 
with these groups (Fig. 7), and hence the alternative states 
associated with these bacteria are easily overlooked in ecosystem- 
level analysis. Interestingly, whereas these less abundant groups 
had more moderate influence at the ecosystem-level variation, 
their high-abundance states were notably more prevalent 
compared with the high-abundance state of the more abundant 
Prevotella group, with the following population frequencies: 
Prevotella group (20%), Dialister spp. (28%), UCI (53%), UCII 
(57%) and B. fragilis (75%). 

We also observed significant associations between the bistable 
bacteria, community diversity and the overall metagenomic gene 
richness (Wilcoxon test; P<0.01 for all groups, except B. fragilis; 
Supplementary Fig. 6; Supplementary Note 4) when we 
incorporated the recently reported metagenomic sequencing data 
that were available for 255 subjects”? in our main cohort. In our 
analysis of these meta-genomes, the high-abundance state of the 
B. fragilis and Dialister spp. groups were found to be associated 
with a low gene count, in line with their reported association with 
pronounced metabolic dysfunction??. On the contrary, we 
observed associations between the high-abundance state of the 
Prevotella, UCI and UCII groups and the high gene count, wa 
has been linked to a potentially healthy metabolic phenotype’. 
The increased metagenomic richness could be explained by the 
increased community diversity associated with these taxa; among 
the bistable groups, the UCI and UCI showed the strongest 
association with the overall community diversity (9 =0.5; 
P<0.001; upper tail (AS89) test). Whereas the community 
diversity exhibited also mild positive correlation with the 
Prevotella group (9 =0.13; P=0.04), the overall community 
diversity was higher in the subjects with high-abundance 
UCI/UCII than in the subjects associated with high-abundance 
Prevotella (P<0.001; t-test). The higher diversity associated with 
the UCI/UCI groups (<2% relative abundance) could be 
partially explained by the more dominating status of the 
high-abundance Prevotella group (10% relative abundance), 
which tends to reduce the overall community diversity. We 
did not observe significant associations between the B. fragilis 
and Dialister spp. groups and community — diversity 
(|p| <0.07; P>0.3). 


Host factors and health status. The emerging picture is that 
rather than overarching alternative states corresponding to dis- 
tinct community types, the natural variation of the human 
intestinal microbiota is reflected in specific bacterial groups 
representing ‘tipping elements’ that are only loosely coupled, 
specific components of the intestinal microbiota that exhibit 
alternative stable states linked to the overall microbiota compo- 
sition and host factors. We have borrowed this analogy from the 
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climate system that is now also believed to have distinct tipping 
elements'!. An important corollary is that the resilience (the 
capacity to recover from perturbations) of the alternative states 
will be influenced by a range of factors that may primarily affect 
specific subcommunities within the intestinal microbiota, rather 
than the overall ecosystem. Gradual changes in external or 
internal factors associated with the ecosystem changes may drive 
the implicated bistable bacteria towards a tipping point where an 
abrupt shift between the contrasting alternative states may 
follow®*°, Our analysis indicates that such shifts are often 
associated with changes in a wider set of covarying taxa (Fig. 7). 
One way to infer how resilience of the microbiota might 
depend on such factors is to use probability densities of 
bacterial abundance to estimate the basins of attraction. 
On the basis of this, we can observe, for instance, that a 
basin of attraction around an apparently resilient high-abundance 
state of UCI arises in late middle age (Fig. 8; Supplementary 
Table 3). This statistically significant association between 
UCI and age (P<0.05; y* test) suggests that the resilience 
of the low-abundance state decreases with increasing age, 
and eventually, even a small perturbation may induce a 
shift to the high abundance state. Moreover, the analysis shows 
that population-level variation may mask alternative states that 
are more pronounced in specific subpopulations, such as 
particular age groups (Fig. 8b). According to a logistic 
regression model, each age decade increased the odds of the 
UCI high-abundance state by 2.641.2% (95% confidence 
intervals), but the changes were particularly emphasized in 
the young and elderly populations. These findings are in line 
with previous studies that have reported notable changes in the 
overall microbiota composition in the elderly population?”*!~?, 
and with a recent study where differences at the community 


Elderly population 


level were observed between young and elderly adults, but less 
during the middle age*!. 

Overall, we observed a range of statistically significant 
associations between the bimodal taxa and host factors such as 
ageing, obesity and health status based on logistic regression 
(P<0.05; y7 test; see Methods). Among the 1,006 western adults, 
body mass index (BMI) was negatively associated with UCI and 
UCIL Age was positively associated with the UCI and UCII, and 
negatively with the Dialister spp. and Prevotella groups. Males 
had higher levels of bacteria belonging to the Prevotella group and 
lower levels of the B. fragilis group compared with women. 
Regarding nationality, the Prevotella group was underrepresented 
in Nordic and Southern European countries and UCII in South 
Europe, while the Dialister spp. and B. fragilis groups were 
particularly common in Northern Europe (Scandinavia, UK and 
Ireland). Analysis of an additional phylogenetic microarray data 
set derived from subjects with irritable bowel syndrome and 
severe obesity indicated association between these health 
complications and the low-abundance UCI and UCII (see 
Methods). Metabolic syndrome (MetS) was positively associated 
with B. fragilis (false discovery rate (FDR)<5%), and showed a 
trend (FDR<20%) towards positive association with Dialister 
spp. and a negative association with the Prevotella group 
(Supplementary Table 4). We did not observe associations 
between the bimodal groups and type I diabetes, ulcerative 
colitis or cardiovascular disease in this data collection. Overall, 
these results suggest that differences in health status can be 
reflected in changing state probabilities of specific taxa. 

In conclusion, these observations provide substantial evidence 
for bistability in specific taxonomic groups within the human 
intestinal microbiota, including Dialister spp., the Prevotella 
group and the uncultured Clostridiales groups UCI and UCII. 
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Figure 8 | Changing state probabilities associated with ageing. (a) A schematic illustration of shifting state probabilities in the UCI group associated with 
ageing. The UCI abundance data (bottom plane) suggest a catastrophe-fold, where the solid and dashed parts of the curve correspond to stable states and 
unstable equilibria, respectively. The depth and width of the potential minima indicate decreasing resilience of the system towards the bifurcation points (*) 
during ageing. (b) Observation density in various age groups highlights the associations between the UCI state probabilities and age (sample sizes: 220 
(18-30 years); 147 (31-40 years); 192 (41-50 years); 258 (51-60 years); 114 (61-70 years); and 19 (71-77 years)). The alternative states of low (blue) and 
high (red) abundances are here more clearly pronounced than in the population-level histogram pooled over all age groups (Fig. 1). 
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These groups exhibited robust bimodality in their abundance 
distributions, and decreased temporal stability at the intermediate 
abundance range. A bimodal abundance distribution associated 
with the B. fragilis group was also observed, but we could not 
confirm the bistability of this group in subsequent analyses. Our 
findings were supported by independent subsets of the data using 
different DNA extraction methods and geographical distribu- 
tions, and in additional data sets from dietary intervention trials, 
including 46 placebo-treated control subjects. Whereas the 
observed states were robust to short-term dietary differences, 
which could not explain the observed population-level bimodal 
patterns, this does not exclude the possibility that targeted dietary 
interventions could be potentially used to manipulate specific 
bimodal groups in a personalized manner; we did observe state 
shifts in individual subjects following dietary interventions, but 
these changes were masked by the overall within-state variation in 
the other subjects and did not reach statistical significance in our 
limited intervention data. The observed patterns were most 
pronounced in individual phylotypes and in specific subpopula- 
tions, such as different age groups, and linked to the overall 
ecosystem composition and host health. 


Discussion 

Our results demonstrate that despite the dominating nature of 
gradual variation in bacterial abundances in the intestinal 
ecosystem, specific bacterial groups form a limited number of 
contrasting, stable configurations that associate with host 
physiology and health. Bistability in bacterial abundances can 
be induced by many factors, such as disruptive selection>’, 
competition and cooperative feedback loops*4, — or 
environmentally or genetically determined host factors. Hence, 
we believe that the here identified bistable taxa, together with 
other microbial co-occurrence relationships*®>>*°, will be 
instrumental in providing insight on the variation, regulation 
and health implications of the intestinal microbiota. Targeting 
specific subpopulations, as opposed to the daunting complexity 
and variability of the entire ecosystem, can simplify the 
characterization and possible manipulation of the intestinal 
microbiota. The implicated taxa may allow stratification of the 
subjects based on alternative states that in parallel to gene 
richness”? represent promising novel targets for microbiome- 
based diagnostics and therapies*’. Moreover, resetting these 
‘bacterial dual in-line package (DIP) switches’ may be a radically 
new way to approach the rapidly growing number of health issues 
related to the intestinal microbiota, in analogy to the binary DIP 
switches from early computer era. Further intervention studies 
will be needed to address the causal relations with the overall 
community composition and host factors, and to investigate 
whether these alternative stable states translate into differential 
disease susceptibility or drug response of the host. 


Methods 


Sample collection. Phylogenetic microarray data from over 5,000 intestinal 
samples were available in the in-house MySQL database of the HITChip, a phy- 
logenetic microarray!*. To investigate the microbiota variability in western adult 
population, we selected from the HITChip database 1,006 western adults (18-77 
years; average BMI 26.7kgm~ 7; s.d. 5.8kgm~ 2). To control the potentially 
confounding effect of other environmental or host factors, we curated the available 
sample annotations and excluded subjects who had reported diseases or antibiotic 
treatments, or were living outside western countries (Europe/the United States), 
and used bootstrap analysis to verify that the results were robust to random 
sampling fluctuations as described below. A single faecal sample per subject was 
selected; the first time point was used if multiple samples were available. The 
subjects cover 15 western countries (Europe and the United States) that were 
further aggregated into Central European (Belgium, Denmark, Germany, the 
Netherlands), Eastern European (Poland), Nordic (Finland, Norway, Sweden), 
South European (France, Italy, Serbia, Spain), UK/Ireland (UK, Ireland) and the 
United States (the United States) groups. To our knowledge, this is the largest 


collection of phylogenetic microarray profiling data of the human intestinal 
microbiota. Due to the retrospective nature of this study, we did not select a 
particular sample size based on power calculations or related approaches or 
replicate the experiments but instead included all available data, and performed 
various statistical tests and verifications in independent subsets of the data to assess 
the significance and robustness of the findings. 


Microbiota profiling. Culture-independent techniques for characterizing intestinal 
microbiota include phylogenetic microarrays and sequencing-based approaches*®. 
Phylogenetic microarrays can be used to quantify with high accuracy the known 
(previously detected) bacteria including the low-abundance phylotypes that often 
remain undetectable with conventional sequencing depths*?. Our analysis is based 
on the phylogenetic HITChip microarray!*, produced by Agilent Technologies 
(Santa Clara, CA, USA), that provides a sensitive analysis platform to assess 
differences in relative abundance of intestinal bacteria. The HITChip microarray 
targets the V1 and V6 hypervariable regions of 16S rRNA gene, and can detect 
1,033 species-like bacterial phylotypes (> 98% sequence similarity in the 16S rRNA 
gene) that represent the majority of the bacterial diversity of the human intestine)’. 
For the analyses, hybridization signals were summarized to 130 genus-like 
phylogenetic groups (> 90% sequence similarity in the 16S rRNA gene) that are 
referred to as species and relatives, the latter being shortened as ‘et rel.’!*. The 
HITChip microarray allows highly reproducible (98 + 2% Pearson correlation 
across technical replicates) and deep analysis (reproducible detection of phylotype 
abundances below a 0.1% relative abundance level) of the phylotype composition of 
intestinal samples, comparable to 200,000 next-generation sequencing reads per 
sample>?. Hence, the standardized analytical pipeline for the HITChip microarray 
data collection provides advantage over sequencing-based approaches, where the 
comparable collections of intestinal microbiota profiling data are smaller, and 
integration of data from individual studies is more challenging, for example, due to 
the use of different primers, continuously changing sequencing platforms and 
protocols. The faecal samples, collected in the context of clinical trials, were 
typically taken at home, frozen at — 20 °C, delivered to the study centre within 24h 
and stored at — 80°C. The DNA was extracted and prepared by amplification of 
the full-length 16S rRNA gene, followed by transcription and labelling of the 
resultant RNA with Cy3 and Cy5 before fragmentation and hybridization on the 
array!*, The signal intensity data from the microarray hybridizations were collected 
using the G2505C scanner by Agilent and preprocessed using an in-house MySQL 
database and custom R scripts. Each scanner channel from the array was spatially 
normalized using polynomial regression, followed by outlier detection and filtering 
in each set of probes with a y? test. The intensities were sample-wise quantile 
normalized and averaged to a single value for each probe-sample pair. Each sample 
was hybridized on at least two replicates to ensure reproducibility. Between-sample 
normalization was performed at probe level with the min-max normalization using 
0.5% quantiles estimated from the data. Each targeted genus-like group is probed 
by multiple (median 19) oligos. Logarithmized (log,o) signal of the probes targeting 
the same phylotype were summarized with Robust Probabilistic Averaging*”*! 
(f{RPA; RPA R package). The log;9-transformed signals were used as a proxy for 
bacterial logarithmic abundance. As a proxy for relative abundance, we used the 
fraction of the total probe signal calculated for the probes targeting a given 
taxonomic group. 


Bacterial abundance types. We categorized the 130 genus-like groups into 
characteristic ‘abundance types’ to gain an overview of their overall population- 
level variability. The abundance types included rare taxa and prevalent taxa with 
symmetric, (left- and right-) skewed and bimodal distributions (Fig. 2; 
Supplementary Table 1). The bimodal taxa were identified as described below, 
including verification in multiple data sets. For the other genus-like groups, the 
categorization was based on the 401 unique subjects whose faecal samples were 
processed with the repeated bead beating protocol that is based on mechanical cell 
disruption method for DNA extraction, and that we have found to be superior to 
alternative methods in terms of DNA extraction efficiency’. Altogether, 60 
bacterial groups were abundant and prevalent, exceeding HITChip log), signal of 
3.0 in > 20% of the subjects; the remaining 70 groups were considered rare. The 34 
prevalent groups with symmetric log abundance (absolute skewness <0.5; e1071 
CRAN R package) exhibited relatively constant logarithmic abundances across 
subjects, fluctuating symmetrically around a mean value. The skewed types 
included both rare and prevalent bacteria with both left- and right-skewed types 
(skewness < — 0.5 and >0.5, respectively). 


Potential analysis. We used potential analysis as the primary state identification 
method. This approach is based on the theory of stochastic dynamical systems, and 
was recently applied to identify alternative stable states in climatology and forest 
ecosystems’>!4, In the following, we provide a brief summary of the underlying 
theory and assumptions. 

Let us assume an underlying stochastic system that has a potential function of 
the form 


dz = — U(z)dt+odw, (1) 


where U(z) is the potential function, z is the state variable (in our case, the microbe 
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log) abundance), o is the noise level and dW is a noise term (Wiener process). The 
minima of the potential function correspond to stable states of the system. The 
corresponding Fokker—Planck equation connects the potential of this model to the 
probability density of the state variable z. The potential U is given by: 


U = — a" log(pa)/2 (2) 


where pq is the empirically estimated probability density function of the state 
variable z. An example of the estimated probability density for the bimodal 
Dialister spp. in our data is shown in Supplementary Fig. 3. As our primary interest 
is in qualitative analysis of the potential function shape, we scaled the potential to 
the noise level (U/a7) (ref. 13). We estimated the probability density with Gaussian 
kernels (the R density function) and automated kernel width adjustment with 
Scott’s method*? (the bw function in the R stats package), giving the bandwidth 
h=1.06s/n"/>, where s is the s.d. of the data points and n is the sample size. We 
adjusted this bandwidth with a factor of 0.8 to achieve more sensitive detection of 
multimodality. We estimated the local minima and maxima numerically and 
reduced spurious findings by requiring that the difference between local minima 
and the closest maxima was larger than a threshold value!> (0.005 in our analysis) 
and with a minimum density of 0.1. We used the number of local minima of the 
potential function to quantify the number of distinct modes!*. 


Bimodality detection. We implemented the potential analysis as a state detection 
methodology!*!* in the earlywarnings R package (http://www.early-warning- 
signals.org/; v. 1.0.59; livpotential_ews function). To avoid spurious findings 
associated with random sampling fluctuations and to quantify the robustness of 
bimodal patterns, we repeated this analysis in 100 bootstrap data sets, and selected 
for further analysis the taxonomic groups that exhibited consistent evidence for 
multimodality in >90% of the bootstrap data sets (Supplementary Note 5). While 
the estimated number of modes in some cases varied between the bootstrap data 
sets, the evidence was the strongest for two alternative states in all multimodal 
groups. Hence, we used the overall fraction of bootstrap samples that indicated 
deviation from unimodality to quantify the evidence for bimodality. Our sample 
collection included samples from multiple studies extracted using different DNA 
extraction methods, which could affect bimodality detection in a taxon-specific 
manner’». To control this, we confirmed the bimodality of the reported six bimodal 
groups in the subset of 401 samples extracted using the mechanical lysis-based 
DNA extraction method (repeated bead beating)”. We also assessed bimodality in 
the subset of 287 samples processed using enzymatic lysis. The UCII group had a 
moderate (83%) bootstrap support for bimodality, whereas bimodality of the 

B. fragilis group was not supported in this smaller data set (31% bootstrap support). 
The bimodal patterns in Dialister spp., P. melaninogenica, P. oralis and UCI were 
verified in 100% of the bootstrap samples also in this data set. For the final analysis, 
we included the B. fragilis, Dialister spp., P. melaninogenica, P. oralis, UCI and 
UCI groups whose bimodality was detected in the overall data collection and in at 
least one of the two distinct DNA extraction method subsets with > 90% bootstrap 
support (Supplementary Note 6). An independent analysis of bimodality in these 
univariate observations of bacterial abundance was performed using the PSI (with 
M= 20 random splits into training and test sets; fpc R package) based on the 
partition around medoids clustering and Euclidean distance!®!°. 


Tipping point. The tipping point is here defined as the particular bacterial 
abundance value that maximizes the potential function of a bistable system; this 
corresponds to an unstable threshold between the contrasting alternative stable 
states, which correspond to the local minima of the potential function. The 
potential minima and maxima were determined by potential analysis. We used the 
average over the different bootstrap data sets as the final tipping point estimate; this 
provides increased robustness against random sampling fluctuations. 


Temporal analysis. To investigate the temporal stability of the observed alter- 
native states, we analysed 78 subjects from the main cohort with additional follow- 
up data from dietary intervention trials; 28% of the follow-up samples were subject 
to dietary intervention but no significant changes following the dietary intervention 
were seen in the bimodal taxa, except for the UCI and UCII groups (P< 0.05; 
t-test). The median age of the subjects was 49 years (27-63 years); the median BMI 
33 kgm 2 (18-43 kgm *); most subjects (1 = 65; 86%) were females from Fin- 
land (n= 43; 55%) and Belgium (n = 32; 41%); and 81% of the subjects had 2-3 
time points; the remaining 19% had 4-5 time points. Most (95%) time points were 
within 1-9 months from the baseline (mean 3.5 months), with the average follow- 
up interval (time between consecutive time points) of 2.4 months. Despite some 
variability between subjects, the follow-up times were within a similar time span 
and the differences in the follow-up times did not explain differences in the 
observed fluctuations between time points, as these two variables were not sig- 
nificantly correlated in our data (|p|<0.05; P>0.05; upper tail (AS89) test). We 
quantified the stability at the intermediate abundance range by calculating the 
correlation between the distance from the intermediate (50%) abundance quantile 
and the observed shift between the consecutive time points; negative correlations 
indicate instable regions where larger shifts in microbial abundance are observed 
on average. In addition, we approximated the state shift frequencies in subjects 
with varying follow-up times based on Kaplan-Meier survival analysis (the survival 


R package), which gives an estimate of the fraction of subjects remaining in their 
initial state as a function of time based on limited observations with varying follow- 
up times. We used the estimated fraction of subjects remaining in their original 
state at the 3-month time point to quantify state stability (Supplementary Note 7). 
Since the observed state shifts could arise from natural, continuous fluctuations in 
bacterial abundance rather than abrupt shifts between alternative stable states, we 
used the same procedure to estimate how often state shifts would be observed in 
the other prevalent taxa based on simulated states of low and high abundance 
based on the same abundance quantiles to determine the ‘tipping point’ between 
the two states, and compared the bistable groups with the other prevalent taxa 
(Supplementary Fig. 2b). 


Community-level analysis. We assessed the evidence for alternative states at the 
overall ecosystem level based on PCA (Supplementary Fig. 7a) and the partition 
around medoids multivariate clustering with Jensen—Shannon dissimilarity, cou- 
pled with the PSI to determine the optimal cluster number as this combination was 
earlier shown to have a good overall performance!” (Supplementary Note 8). In 
addition, we used principal coordinates analysis (that is, classical multidimensional 
scaling; cmdscale function of the R stats package)‘ as a complementary nonlinear 
approach for community-level visualization!® based on generalized UniFrac 
distances*® (GUniFrac function of the R GUniFrac package**) that take into 
account the phylogenetic structure of the community, quantified by HITChip 
probe-level analysis (Supplementary Fig. 7b). 


Short-term dietary interventions. Whereas dietary information was not available 
for our main data set, we analysed the effects of short-term dietary interventions in 
an additional data set from the HITChip database consisting of 138 subjects 
sampled during dietary intervention studies, including three previously pub- 
lished!®!7-49 and two unpublished dietary intervention trials: (i) 52 Finnish adults 
fulfilling the criteria for MetS that were subjected to qualitative changes in 
carbohydrate intake!*® (50% of the samples in a control group); (ii) 33 obese Belgian 
women from a prebiotic (inulin-type fructan) intervention study* (50% of the 
samples in a control group); (iii) 31 obese Finnish subjects with weight-loss diet 
(HITChip project ‘ISRCTN67529475’); (iv) 8 Norwegian subjects with a vegan diet 
(HITChip project “Vegan’); and (v) 14 obese British males with fibre-supplemented 
and weight-loss diets!’. In addition to the dietary intervention subjects, this data 
collection included 46 placebo-treated control subjects. 


Associations with host parameters. We quantified associations between the 
bimodal taxa and host factors by using subject metadata (age, BMI, sex, nationality) 
and DNA extraction method as fixed effects in multiple logistic regression to 
predict the state of the corresponding bacteria in each subject. The significance of 
each parameter was estimated by log-ratio (y7) test comparing the models with and 
without the corresponding factor in predicting the state (the GLM function in the R 
stats package). Missing annotations were omitted, and the FDR was estimated to 
correct for multiple testing based on the Benjamini-Hochberg method with the 
p-adjust function (R stats package). 


Health associations. To investigate the associations between the alternative states 
and health status, we compared the 1,006 adults with no reported health compli- 
cations with additional samples in the HITChip database representing western 
adults with irritable bowel syndrome (n= 106), MetS (n= 66), type II diabetes 
(n = 78), ulcerative colitis (n = 62) and cardiovascular disease (n = 45). In addition, 
we divided the non-compromised subjects into severe obese (BMI>35; n = 136) 
and other (BMI <35; n= 870) subjects. We used multiple logistic regression to 
predict disease occurrence from logarithmic abundance of each bimodal bacterial 
group. The DNA extraction method, age, BMI and sex were included in the model 
as potentially confounding variables. The P values from log-ratio test were cor- 
rected using the Benjamini-Hochberg procedure. For exploratory purposes, asso- 
ciations with FDR< 20% were considered to establish a trend and FDR<5% to be 
significant (Supplementary Table 4). We compared the performance of the logistic 
regression model with and without bacterial abundance based on receiver operator 
characteristic analysis to detect disease occurrence; the average area under curve 
(AUC) values were derived from fivefold cross-validation. While our analysis 
suggests significant association between MetS and specific taxa, the differences in 
the AUC values between the models with and without bacterial abundance were 
small (0.1 increase with bacterial abundance) in the MetS group, indicating limited 
predictive ability in our data to distinguish the effects of MetS from the effects 
of age, sex and BMI. In severe obesity, inclusion of bacterial abundance notably 
improved disease prediction (AUC 0.58 versus 0.69 for UCI; 0.60 versus 0.68 

for UCII). 
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Abstract | Despite an extensive body of reported information about peripheral and central mechanisms 
involved in the pathophysiology of IBS symptoms, no comprehensive disease model has emerged that would 
guide the development of novel, effective therapies. In this Review, we will first describe novel insights into 
some key components of brain—gut interactions, starting with the emerging findings of distinct functional and 
structural brain signatures of IBS. We will then point out emerging correlations between these brain networks 
and genomic, gastrointestinal, immune and gut-microbiome-related parameters. We will incorporate this new 
information, as well as the reported extensive literature on various peripheral mechanisms, into a systems- 
based disease model of IBS, and discuss the implications of such a model for improved understanding of the 
disorder, and for the development of more-effective treatment approaches in the future. 
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Introduction 

IBS is the most common functional gastrointestinal dis- 
order, occurring in up to 15% of the population world- 
wide.' Even though the syndrome is defined by chronically 
recurring abdominal pain and discomfort associated with 
altered bowel habits in the absence of organic disease, 
increased trait anxiety, as well as comorbidity with psy- 
chiatric and other chronic pain syndromes are common.’? 
Despie an extensive body of reported information about 
peripheral* and central®’ mechanisms involved in the 
pathophysiology of IBS symptoms, and the development 
of animal models with high face and construct validity 
(that is, the models have many features similar to and 
is based on a similar pathophysiological concept as the 
human disease),'’ no comprehensive disease model has 
emerged that would guide the development of novel and 
effective therapies. This aspect is surprising in view of the 
comprehensive data as well as clinical experience demon- 
strating the strong relationship between psychosocial 
factors and IBS symptoms,” and in view of breakthroughs 
in the identification of several IBS-related biological 
abnormalities at several levels: the gut epithelium;” 
immune system;*° neuroendocrine mechanisms;'* brain 
structure and function;’ stress response;" affective,*" 
cognitive®'*'* and pain modulation’?”® abnormali- 
ties; gene polymorphisms;? and the gut microbiome.*?” 
Despite these insights, a comprehensive understanding 
of how these various factors interact, and particularly to 
what degree they are involved, in the generation of symp- 
toms in IBS in general or in IBS subsets (as opposed to 
representing epiphenomena) has not emerged. The long 
history of IBS research”? is full of examples of reported 
abnormalities (including excessive mucus production, 
alterations in sigmoid colon motility, slow wave abnor- 
malities in smooth muscle, gut inflammation and others), 


Competing interests 
The authors declare no competing interests. 


which were reported in small sample cohorts and often 
not confirmed in validation sample cohorts. Typically, 
the majority of reported mean differences are small when 
compared with healthy individuals, often do not take into 
account sex-related differences, might only be present in 
subsets of patients and correlations with clinical symptoms 
are weak. Thus, only a very small number of findings have 
translated into highly effective therapies.” These therapies 
include the serotonin (5-HT) 5-HT, receptor antagonist, 
alosetron, the 5-HT, agonist, tegaserod, the guanylate 
cyclase agonist, linaclotide, and the chloride channel 
blocker lubiprostone (for chronic idiopathic consti- 
pation).” Even though effective, the first two agents were 
restricted or withdrawn due to serious adverse effects, 
whereas the latter two are primarily targeted at treating 
the symptom of constipation.” 

The brain and gut show reciprocal interactions in health 
and disease. For example, altered brain outputs via the 
autonomic nervous system (ANS) and the hypothalamic- 
pituitary—adrenal (HPA) axis have been shown to influ- 
ence intestinal motility and secretion,” intestinal epithelial 
permeability,’'*”” immune function** and gut microbial 
composition.”' In addition to these well-established influ- 
ences of the brain on peripheral target cells, peripheral 
environmental gut-directed factors—in particular, dietary 
factors” and intestinal pathogens*’—might have an 
equally important role on these same processes. Regardless 
of primary cause for the observed peripheral findings, 
several of them (in particular immune and microbiota- 
related signalling) can feed back to the brain, setting up 
circular regulatory loops (Figure 1). 

Major barriers exist for progress towards a comprehen- 
sive understanding of IBS pathophysiology, which incor- 
porates such circular regulatory loops. Despite a wealth 
of information supporting all mechanisms listed already, 
controversy regarding the primary role of the brain versus 
peripheral factors has persisted in the field. The majority 
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Key points 


» Physiological and molecular alterations have been identified in the brain—gut 
axis of human and rodent models of IBS, yet a comprehensive disease model 
to guide effective drug development has not emerged 

» Studies have identified distinct brain signatures in patients with IBS, which 
provide plausible neurobiological substrates of many previously reported 
behavioural and psychosocial observations 

» Emerging evidence demonstrates correlations of these brain signatures with 
alterations in genetics, immune system and gut microbiota in IBS, even though 
the causality of these interactions remains unknown 

» Asystems-biology-based model is proposed to integrate the growing number 
of central, peripheral and behavioural IBS-related alterations, and to identify 
targets for more effective therapies 
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Figure 1 | Brain-gut axis. Schematic of the brain-gut axis, including inputs from 

the gut microbiota, the ENS, the immune system and the external environment. The 
model includes both peripheral and central components, which are in bidirectional 
interactions. Bottom-up influences are shown on the right side, top-down influences 
on the left side of the graph. Abbreviations: ENS, enteric nervous system; HPA, 
hypothalamic—pituitary—adrenal; PBMC, peripheral blood mononuclear cell; SNS, 
sympathetic nervous system. Modified with permission from Nature Publishing 
Group © Irwin, M.R. & Cole, S.W. Nat. Rev. Immunol. 11, 625-632 (2011).*° 


of research and drug development have focused on single, 
usually peripheral, targets in preclinical models (such as 
ion channels or specific receptors on individual neurons). 
Although such studies have provided major insights 
into mechanisms of visceral pain,'’ their relevance for 
IBS symptom generation in humans has not firmly been 
established. Integration of multiple clinical, psychosocial 
and biological (genetic, immune, neurobiological) find- 
ings into a comprehensive disease model has yet to be 
achieved. For example, few studies aimed at psychosocial 
aspects of IBS have taken neurobiological concepts into 
account. The focus has been on descriptive, symptom- 
based rather than biology-based disease definitions. 


REVIEWS 


For example, different biological mechanisms might be 
associated with similar clinical presentations. The com- 
prehensive identification of distinct biology-based sub- 
groups of patients (including those based on sex), with 
different underlying pathophysiological components, who 
are differentially responsive to specific therapies has also 
not been achieved. A good example illustrating this point 
is a report on subsets of patients with IBS based on gut 
microbial signatures.*! 

In this Review, we will first describe novel insights 
into some key components of brain-gut-microbiome 
axis, starting with evolving concepts about alterations in 
defined structural and functional brain networks. We will 
discuss emerging evidence on how these brain network 
alterations are correlated with the immune system and 
the gut microbiota. We will then incorporate this infor- 
mation, as well as the reported extensive literature on 
various peripheral mechanisms, into a systems-based 
disease model of IBS. Rather than simply synthesizing 
the current knowledge about the disease, we will use this 
model to discuss the implications for better understanding 
of the disorder and for the development of more-effective 
treatment approaches in the future. 


The nervous system component 

Several clinical observations support a major role of the 
brain in IBS symptoms: the brain is ultimately responsible 
for generating the subjective experience of abdominal 
pain, discomfort and anxiety; stressful life events in early 
life have a major role in vulnerability to develop IBS, and 
psychosocial stressors in adulthood play crucial parts in the 
first onset, and perceived severity of symptoms;” centrally 
targeted pharmacological treatments and cognitive behav- 
ioural strategies are some of the most-effective treatment 
strategies.*’ The role of the enteric nervous system (ENS) 
in the regulation and coordination of motor and secretory 
functions of the gastrointestinal tract, in sensory function 
and its interactions with the brain have been extensively 
reviewed.***? Consistent with the theme of this Review, 
the complexity of the interactions between multiple gut- 
based cell types (intrinsic and extrinsic sensory neurons, 
enteric glia, immune cells and innervated enteroendocrine 
cells) has been referred to as the ‘gut connectome.** The 
interactions between the ENS and central nervous system 
(CNS), two key components of the brain-gut axis, are 
mediated by the sympathetic and parasympathetic 
branches of the ANS,° and by multiple sensory and endo- 
crine pathways.** Although it is difficult to characterize 
alterations of the ENS or of ANS-mediated brain-gut con- 
nections in living humans directly, functional, structural 
and metabolic brain imaging approaches have become a 
highly productive avenue to study brain-gut-microbiota 
interactions in health and disease.*”*”** 

Although it has long been assumed that specific brain 
functions such as pain processing, emotion or cogni- 
tion are attributable to the isolated operations of single 
brain regions, these processes are now viewed as result- 
ing from the dynamic interactions of distributed brain 
areas operating in large-scale networks (Figure 2). These 
networks and their properties have been assessed by 
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Figure 2 | Brain networks contributing to IBS symptoms. Depicted are task-related brain networks that have been described 
in the literature and for which structural and functional alterations have been reported in patients wiht IBS. The box in the 
centre describes the clinical symptoms related to the network inputs. Outputs most relevant for IBS pathophysiology 

occur in the form of descending pain modulation and autonomic nervous system activity. Abbreviations: Amyg, amygdala; 
alNS, anterior insula; aMCC, anterior midcingulate cortex; BG, basal ganglia; dIPFC, dorsolateral prefrontal cortex; 

Hipp, hippocampus; Hypo, hypothalamus; LCC, locus coeruleus complex; M1, primary motor cortex; M2, supplementary 
motor cortex; mPFC, medial prefrontal cortex; NTS, solitary nucleus; OFC, orbitofrontal cortex; PAG, periaqueductal grey; 
pgACC, pregenual anterior cingulate cortex; pINS, posteria insula; PPC, posterior parietal cortex; sgACC, subgenual anterior 
cingulate cortex; Thal, thalamus; vIPFC, ventrolateral prefrontal cortex. 


using neuroanatomical and neurophysiological studies 
in animals,'’ as well as different brain imaging tech- 
niques and analyses in humans.*”*° In humans, several 
types of networks have been reported: functional brain 
networks based on evoked responses” or intrinsic con- 
nectivity of the brain during rest;*°*!** structural net- 
works based on grey matter parameters*’ and white 
matter properties; and anatomical networks based on 
white matter connectivity.** Both evoked and resting 
state studies performed in patients with IBS have dem- 
onstrated abnormalities in regions and task-related net- 
works linked to emotional arousal,*”-°’ central autonomic 
control,®°*°° central executive control,’!””°* sensorimotor 
processing®°’*! and salience detection.*”” IBS-related 
alterations in these networks have provided plausible 
neurobiological substrates for several information pro- 
cessing abnormalities reported in patients with IBS, such 
as biased threat appraisal and expectancy of outcomes 
(for example, salience network), autonomic hyperarousal 
(emotional arousal and central autonomic networks), and 
symptom-focused attention (central executive network).° 

In the next section, we will discuss IBS-related changes 
that have been identified in these task-related networks. 
The individual networks are depicted in Figure 2, and their 
basic properties are described in Table 1. 


Emotional arousal network 

This network acts as an important link between stimulus 
appraisal (salience network) and ANS output (generated 
in the central autonomic network) to peripheral targets 
(gastrointestinal tract, gut microbes, immune system), 
thereby having an important role in determining the 
magnitude and duration of autonomic modulation of 
various gut functions. A reduction in the inhibitory feed- 
back loop within the arousal network has been demon- 
strated in patients with IBS compared with healthy 
individuals,'*® and in healthy individuals as controls 
after decreasing central serotonin levels by acute trypto- 
phan depletion.” Several studies published during the 
past decade support an increased responsiveness of emo- 
tional arousal circuits in relation to both expected and 
to delivered visceral stimuli, particularly in women.*”” 
A meta-analyses of functional MRI studies published 
between 2000 and 2010 demonstrated that during con- 
trolled rectal distension, patients with IBS show more 
consistent activation in regions associated with emo- 
tional arousal than healthy individuals.°> Emotional 
arousal circuit reactivity is associated with 5-HT-related 
gene polymorphisms.” IBS-related functional alterations 
are accompanied by structural brain alterations in key 
regions of this network.” 
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Table 1 | Brain networks with relevance to IBS 
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Network Core regions and inputs Key features 

Default Medial prefrontal cortex, posterior cingulate Comprised of brain regions whose activity is greater during rest 

mode or retrosplenial cortex, inferior parietal than goal-directed task performance*** 

network cortex, lateral temporal cortex and Associated with self-related processing, including monitoring 
hippocampal formation*”°**° internal thoughts and future planning?7”2182183 

Emotional Amygdala, its positive connections with the Activated by perceived or real perturbation of the 

arousal locus coeruleus complex and inhibitory organism’s homeostasis 

network feedback projections to the amygdala from Generates rapid feedback inhibition of amygdala,18+1® thereby 
prefrontal and anterior cingulate subregions, limiting the magnitude and duration of network activity and 
and from the hippocampus related activity in the central autonomic network 

Central Control centres in the pontine-medulla Regions related to SNS control overlap with the executive- 

autonomic (including PAG and hypothalamus), the central processing and salience-processing networks (including the 

network nucleus of the amygdala, and several cortical ventral anterior INS), whereas regions related to 


regions (including the anterior INS, ACC, 
prefrontal and motor regions)*8’188 


Sensorimotor Core cortical regions are primary somatosensory 


network cortex (S1, post central gyrus), primary motor 
cortex (M1; precentral gyrus), secondary 
somatosensory cortex (S2) and supplemental 
motor areat®? 
Close connections exist between the posterior 
INS (primary interoceptive cortex) and S1 
Sensorimotor network connectivity to the 
thalamus, which relays peripheral sensory 
information to the cortex,*% is established 
by 2 years of age*%t 

Central Lateral prefrontal cortices and posterior 

executive parietal cortex?93 

network 

Salience Dorsal ACC and anterior INS 

network 


Core regions have strong connections to 
medial prefrontal and temporal regions, and 
subcortical regions including the amygdala, 
PAG and basal gangliat?* 19° 


Dorsal portions of the anterior INS receives 
prefrontal input, whilst ventral portions are 
closely linked with the amygdala and emotional 
arousal system 


parasympathetic control are more associated with the default 
mode network*> 


Provides central control and modulation of the ANS 


Involved in regulating respiratory, cardiovascular, endocrine and 
digestive system activity during cognitive, affective and motor 
tasks and sensation 


Receives sensory input from the periphery and is important for 
awareness of body sensations and generation of appropriate 
motor responses 


Primary and secondary motor cortex, through their projections to 
the central autonomic network, might have a modulatory role in 
the sympathetic control of visceral function 


Activated during tasks involving executive functions such as 
attention, working memory, planning and response selection 


Often coactivated with regions of the salience network,°° as the 
brain attempts to focus its limited processing capacity to only 
salient information via attention, working memory, planning and 
response selection1% 


Anterior INS can be considered the main hub in the brain, 
switching from default mode network to activity-related networks, 
and coordinating and adjusting bodily and behavioural responses 
to environmental changes 


Responds to subjective salience of any interoceptive and 
exteroceptive stimulus reaching the brain, or to the expectation 
of such stimuli, and coordinates appropriate attentional, 
behavioural, affective and visceral responses to such stimuli??19” 


Responds with the most appropriate responses to biologically 
and cognitively relevant stimuli based on maintaining 
homeostasis*??*% regardless of whether the subject is awake, 
engaged in a particular task, or asleep (achieved through close 
salience network connections with the other networks) 


Abbreviations: ACC, anterior cingulate cortex; ANS, autonomic nervous system; INS, insula; PAG, periaqueductal grey; SNS, sympathetic nervous system. 


Central autonomic network 

An extensive body of knowledge derived from rodent 
studies**””-” supports an upregulation of brain circuits in 
response to chronic stress, in particular the input from 
the locus coeruleus complex to the amygdala and the 
hypothalamus. Information from human brain imaging 
studies has been more limited due to the technical diffi- 
culties in studying key regions of this network, such as the 
hypothalamus, locus coeruleus complex and subnuclei of 
the amygdala, primarily due to limited spatial resolution. 
However, pharmacological brain imaging studies have 
implicated alterations in the corticotropin-releasing factor 


(also known as corticoliberin)-corticotropin-releasing 
receptor 1° and norepinephrine-a adrenergic receptor 
signalling system in this network.™ 


Sensorimotor network 

As with other chronic pain disorders,*-** evidence indi- 
cates that IBS might be associated with alterations in brain 
networks concerned with the central processing and 
modulation of viscerosensory and somatosensory infor- 
mation. For example, compared with healthy individuals, 
patients with IBS showed increased low frequency power 
of spontaneous brain oscillations (suggesting increased 
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neural activity) in regions belonging to the sensorimotor 
network.” These functional changes seem to be accom- 
panied by structural changes in white and grey matter. 
For example, patients with IBS had widespread micro- 
structural white matter abnormalities in sensory process- 
ing and/or modulation regions.” Female patients with IBS 
have cortical thickness increases in sensorimotor areas 
that correlated with clinical measures of symptom sever- 
ity compared with healthy individuals.” Such grey matter 
increases in patients with IBS were also seen in the poster- 
ior insula (INS), the primary viscerosensory cortex, and 
these changes were correlated with IBS symptom dura- 
tion.”' Volumetric grey matter analyses in a large sample of 
female patients with IBS revealed increases in the primary 
somatosensory cortex.”* Furthermore, examining struc- 
tural networks in IBS indicated that two key regions of 
the network (cingulate gyrus and thalamus) were found 
to be network hubs, indicating that these regions are 
more critical for information flow in IBS compared with 
health.” When viewed together, current evidence supports 
the hypothesis that patients with chronically recurring vis- 
ceral pain and/or discomfort have functional as well as 
neuroplastic and microstructural alterations within the 
brain, particularly in regions associated with the process- 
ing, integration and modulation of sensory information. 
The mechanism(s) underlying these alterations include 
chronically increased viscerosensory information flow 
from the gut, or from dorsal horn neurons sensitized by 
descending pain facilitation (see Figure 3). 


Central executive network 

Evidence indicates that patients with IBS might have func- 
tional impairments in cognitive processes associated with 
the central executive network.'*!” Preliminary evidence 
based on administration of the Attentional Network Test” 
suggests that patients with IBS have greater behavioural 
efficiency during the alerting and orienting function of 
attention than healthy individuals. In comparison with 
healthy individuals, this greater efficiency was associ- 
ated with greater activation of anterior midcingulate and 
insular cortices, confirming the previously reported close 
interactions between the central executive network and 
the salience network.*’ Converging evidence also sug- 
gests that increased attention to gastrointestinal symp- 
toms and contexts have an important role in the increased 
perceptual sensitivity to visceral stimuli characteristic 
of IBS.”° Patients with IBS show deficient activation of 
inhibitory cortical regions involved in downregulation 
of pain and emotion as well as attention during expecta- 
tion and experience of aversive gastrointestinal stimuli.*° 
Selective recall of negative and gastrointestinal sensa- 
tion words, as well as selective attention to threat-related 
stimuli, has been demonstrated in patients with IBS.°**° 
Furthermore, a reduction in the effective connectivity of 
the central executive network circuitry (including par- 
ietal, dorsal lateral prefrontal cortex) during repeated 
exposure to the anticipation and experience of a threat- 
ening gastrointestinal stimulus (repeated exposure to 
balloon inflations) was associated with a reduction in IBS 
hypersensitivity."” Data from a sample of Japanese patients 


with IBS, compared with healthy controls, indicated that 
alterations in error feedback mechanisms were associated 
with decreased dorsolateral prefrontal cortex activity.’® 
A strong negative correlation between the cortical thick- 
ness and grey matter density of the dorsolateral prefron- 
tal cortex and pain catastrophizing has been reported.”*” 
Evidence from the University of California, Los Angeles 
research group indicates prepulse inhibition, a process by 
which an organism can filter the flow of information from 
its internal and external environments, is altered in IBS 
compared with health. Together, these data suggest that 
patients with IBS have specific abnormalities in attentional 
processes that have a role in the increased perception of 
visceral stimuli and in IBS symptom severity. 


Salience network 

Studies performed during the past decade on brain 
responses to delivered and expected rectal distension 
have consistently reported increased engagement of the 
core regions of the salience network, the anterior INS 
and anterior midcingulate cortex in patients with IBS,”*” 
which initially but incorrectly were referred to as the 
“pain matrix”. In addition, a close relationship between 
increased affect, central emotional arousal processes and 
enhanced visceral stimulus perception has been reported 
in patients with IBS.°>'!°! Three recent reports published 
between 2013 and 2015 in female patients with IBS have 
identified disease-related alterations in anterior INS activ- 
ity and connectivity in the resting state**”’ and during an 
ambiguous abdominal pain threat,’ confirming a key role 
of salience network alterations in IBS. Reported altera- 
tions in the response and connectivity within the salience 
network are consistent with the prediction error character- 
istic of patients with IBS about the likelihood and severity 
of future gastrointestinal symptoms (catastrophizing)." 


Integration of brain networks 

An extensive body of literature supports the model of the 
central role of aberrant salience computation underlying 
key clinical IBS symptoms (depicted in Figure 3). Well- 
established biological and behavioural consequences 
exist with such a biased appraisal: the engagement of 
altered ANS outputs to targets in the gastrointestinal tract 
(including ENS activity, gut permeability, gastrointestinal 
motility and secretion,’ gut microbial composition and 
metabolites),”’” and to extraintestinal targets (including 
the immune system);'” shifting of the balance of endoge- 
nous pain modulation systems towards increased descend- 
ing pain facilitation;” increased engagement of the central 
executive network resulting in selective attention to gastro- 
intestinal symptoms; and development of prediction errors 
about likelihood and severity of symptoms (so-called 
catastrophizing).'°*'” The model also provides a plausible 
biological basis for the effectiveness of different behav- 
ioural interventions such as cognitive behavioural therapy 
(normalizing salience, executive control and emotional 
arousal networks), self-relaxation techniques (normalizing 
emotional arousal and central autonomic networks) and 
mindfulness-based stress reduction (normalizing salience 
and executive control networks).° 
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Figure 3 | Cross-sectional integrated brain—gut model of IBS pathophysiology. Proposed model for involvement of brain-gut 
axis in the generation of cardinal IBS symptoms (chronic abdominal pain associated with altered bowel habits). Under 
normal circumstances, visceral and external signals are evaluated by the salience network, which generates brain outputs 
in terms of targeted ANS responses (regulating gastrointestinal and immune function) and descending pain modulatory 
activity (regulating pain sensitivity at the dorsal horn level). Target organ alterations (either peripherally or ANS stimulated) 
are signalled back to the brain via neural, endocrine or immune-related channels. These signals are processed within 
subregions of the INS, and depending on their subjective salience, are consciously perceived (associated with activation 
of anterior INS) as normal gut sensations, discomfort or pain. IBS symptoms can arise from several primary peripheral or 
central mechanisms, but once brain—gut interactions are altered, causality is difficult to determine. Abbreviations: Amyg, 
amygdala; ANS, autonomic nervous system; dACC, dorsal anterior cingulate cortex; dIPFC, dorsolateral prefrontal cortex; 
Hypo, hypothalamus; INS, insula; orbFC, orbitofrontal cortex; PAG, periaqueductal grey; rACC, rostral anterior cingulate 


cortex; RVM, rostral ventromedial medulla. 


Genetics and epigenetics 

As with many other chronic diseases that involve the 
brain, IBS probably has a strong developmental compo- 
nent, which starts with the interactions of genetic and epi- 
genetic factors early in life, including the prenatal period. 
In addition to the reported associations of gene variations 
with various peripheral mechanisms,'” imaging genetics 
studies performed in large samples of well-phenotyped 
individuals with and without IBS have identified inter- 
actions between early environmental factors,'” candidate 
gene polymorphisms and brain networks related to emo- 
tional arousal and/or central autonomic control, salience 
and somatosensory integration. The reported genes were 
related to the regulation of the HPA axis: corticotropin- 
releasing hormone receptor 1 (CRHRI, single nucleo- 
tide polymorphisms [SNPs] rs7209436, rs110402 and 
18242924); glucocorticoid receptor gene (NR3C1, SNPs 
182963155 and rs33389);!°%! female sex hormones 
(progesterone receptor or PGR, SNPs rs1042838 and 
1s10895068);'” 5-HT signalling system (HTR3A c.-42C>T 
SNP rs1062613),” inflammation-related genes (IL1B, SNP 
1s16944);'°§ and catecholaminergic signalling (ADRAID, 
SNP rs1556832; ADRA2B, SNP rs1042717; COMT, SNP 
1s174697).'° As discussed in the next section, these inter- 
actions are markedly influenced by epigenetic factors!!!” 


(such as a history of early adverse life events [EALs]) and 
by the sex of the study participant (Figure 4). 

As with other polygenic disorders, it has become clear 
that no single gene variation is sufficient to explain the 
full clinical phenotype in IBS. However, interactions 
between multiple genes, early life experiences and sex 
probably make a small contribution to the overall vari- 
ance of the peripheral and central endophenotypes.'” 
Validation studies in larger samples are required to 
confirm such contributions. 


The external environment component 

Various influences originating in the external environ- 
ment (Figure 1) can markedly affect the development, 
chronicity and severity of IBS. Although some of these 
influences are mediated by the brain (psychosocial 
stress, social support, societal responses to symptoms) 
others are mediated by the gastrointestinal tract, includ- 
ing the gut microbiota, diet, gastrointestinal infections 
and medications. 


Brain-mediated influences 

Clinical literature on the role of psychosocial factors in the 
development, symptom persistence and symptom flares 
in IBS is extensive.'® In addition to the well-documented 
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Figure 4 | Longitudinal brain-gut model of IBS pathophysiology. The interaction between genetic and epigenetic influences 
result in central and peripheral gene expression profiles that underlie the shaping of brain-based and gut-based 
intermediate phenotypes. Epigenetic factors provide the input from environmental influences on the development 

of intermediate phenotypes. Brain and gut intermediate phenotypes interact bidirectionally to shape the clinical 
phenotype of IBS. Abbreviations: AM, amygdala; HF, hippocampal formation; OFC, orbitofrontal cortex; PFC, prefrontal 


cortex; sACC, subgenual anterior cingulate cortex. 


role of stressful life events in adults in preceding or 
exacerbating IBS symptoms,” a history of EALs is associ- 
ated with an increased vulnerability to IBS," disorders of 
mood and affect,''*!"° as well as to a wide range of other 
chronic diseases.''® In contrast to the earlier emphasis 
on the role of a sexual abuse history, it is now clear that 
a variety of factors that disturb the quality of the inter- 
actions between the primary care giver and the child 
during the first 18 years of life (including serious illness 
of the mother, marital discord, divorce, verbal and emo- 
tional abuse) can have equally detrimental effect on adult 
disease vulnerability.""° 

Evidence from brain imaging studies have identified 
structural and functional brain alterations associated with 
self-reports of EALs. Structural alterations in regions of 
the emotional arousal circuitry”® and in regions associ- 
ated with modulating somatosensory and viscerosensory 
processes” were correlated with such reports. Alterations 
in the activity in the brainstem and amygdala in response 
to noradrenergic stimulation has been associated with 
increasing levels of EALs, consistent with an upregulation 
of central autonomic circuits in IBS."” Self-reports of EALs 
were also found to be correlated with brain activity in net- 
works involved in determining the salience of somatic, 
visceral or environmental stimuli in IBS.°’ These results 
suggest that the experience of adversity early in life can 
lead to altered resting state activity in the salience and in 
the central executive network of adults with IBS, possibly 
leading to permanent alterations in salience computation 


of viscerosensory signals by the brain. The observed 
alterations in the functional connectivity of the emotional 
arousal”! and salience networks”’ found in IBS might be 
driven by altered central noradrenergic modulation.*°™ 
Similar changes have also been observed in animals 
exposed to early life stressors,''® and have been linked to 
increased sympathetic nervous system responses.'!? EALs 
are also associated with altered signalling within the HPA 
axis.'*°!7! A study published in 2009 demonstrated that 
self-report of EALs was associated with exaggerated HPA 
axis responses to an aversive visceral stimulus, an effect 
that was more pronounced in male participants.'” 

Considerable preclinical and clinical evidence supports 
the concept that gene expression can be influenced by 
EALs through epigenetic mechanisms, including DNA 
methylation, and that these effects can persist throughout 
adult life."'>!°?!"? As depicted in Figure 4, interactions of 
EALs, sex and vulnerability gene polymorphisms might 
increase the risk of developing IBS by shaping the connec- 
tivity of relevant brain networks (see previous section). 
Even though the prevalence, clinical importance and 
underlying molecular mechanisms of EALs have been 
studied in great detail, there are other factors through 
which the external environment can influence brain func- 
tion. These include, but are not limited to, the beneficial 
role of a strong social support system, which can miti- 
gate the negative effects of EALs, and societal responses 
to patients’ symptom reporting, which can drive a vicious 
cycle of symptom amplification.’ 


598 | OCTOBER 2015 | VOLUME 12 


www.nature.com/nrgastro 


© 2015 Macmillan Publishers Limited. All rights reserved 


Gastrointestinal-tract-mediated influences 

Factors arising from the external environment have been 
implicated in the pathophysiology of IBS and in the modu- 
lation of IBS symptoms. These factors have been reviewed 
elsewhere and so will not be discussed in detail here: 
dietary factors;'”°"'** pathogenic microorganisms;’”*!*° 
and antibiotic treatment.'*'!** It remains to be determined 
if patients with IBS have abnormal mucosal responses to 
any of these factors, if symptoms reported in relation to 
these factors are mediated by alterations in the gut micro- 
biota (see section on gut microbiota), or if it is simply the 
sensitivity of visceral perception that determines if some- 
body develops symptoms or can tolerate the same factors 
without any symptoms. 


The immune system component 

IBS is not an inflammatory disease, but a growing body of 
research suggests that dysregulation in immune function 
might nevertheless contribute to its aetiology or symp- 
toms.*>!*? Mixed data exist on whether plasma or intestinal 
mucosal cytokine levels are associated with IBS.’ However, 
other research has linked IBS to an increased reactivity of 
blood monocytes and increased numbers of mucosal mast 
cells.’*4"° Some studies also indicate that peripheral blood 
mononuclear cells (PBMCs) from patients with IBS show 
abnormal release of proinflammatory cytokines such as 
IL-6, IL-1b and TNF” 

These observations in IBS are consistent with the 
broader role of the nervous system in regulating immune 
cell development and gene expression via neuro- 
endocrine signals from the brain (for example, corti- 
sol from the HPA) and activation of sympathetic nerve 
fibres in the one marrow and other lymphoid organs 
(for example, via the sympathetic neuroeffector mol- 
ecule norepinephrine).'”'*°"'” These regulatory interac- 
tions enable the CNS to integrate information regarding 
the general internal state of the body with information 
regarding real and perceived environmental threats (as 
detected by the salience network) and historical or devel- 
opmental influences (for example, a history of EALs'”’). 
As one example of these regulatory effects, studies have 
found that stressful life circumstances are associated 
with the activation of a so-called conserved transcrip- 
tional response to adversity (CTRA) in PBMCs that is 
characterized by increased expression of proinflamma- 
tory genes and decreased expression of genes involved 
in innate antiviral responses (for example, type I inter- 
ferons) and IgG antibody production.'°*!*°!*4!* CTRA 
gene expression can also be experimentally invoked by 
social stress in animal models” and is mediated in part 
by sympathetic nervous system (SNS)-induced increases 
in bone marrow haematopoietic production of imma- 
ture and immunologically primed monocytes (CD16 in 
humans, Ly-6c™ in mice).'“° These primed monocytes can 
also be reciprocally recruited into the CNS by exposure of 
mice to social threat and aggression.” The integration 
of these observations suggests a new hypothesis regard- 
ing the immune system’ role in the pathogenesis of IBS: 
high levels of SNS activity during early developmental 
periods (stemming from either genetic or environmental 
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triggers such as EALs) might lead to increased production 
of immature primed monocytes that both traffic into the 
gut to alter local function and ENS plasticity, and traffic 
into the brain to affect CNS plasticity (including struc- 
tures involved in salience processing and autonomic 
regulation). The result can be viewed as a physiological 
Hebbian association between gut biology and brain func- 
tion, resulting in a self-perpetuating feedback system in 
which a sensitized gut generates ongoing adverse sensory 
experiences (symptom flares), to which a neurally sen- 
sitized brain responds with both greater aversion and 
increased sympathetic outflow, resulting in upregulated 
monocyte production that further promotes neural 
alterations in both the gut and the brain. 

Several areas of empirical evidence are consistent with 
this systems-level brain-immune-gut hypothesis, inclu- 
ding: brain regulation of primed monocyte production 
via the SNS; stress-induced migration of primed mono- 
cytes to the brain;'” inflammation-induced neuroplastic 
changes in the brain'” (for example, altering affective 
behaviour'*’); reinstatement of previous stress effect on 
brain and behaviour by subsequent exposure to mild stress 
weeks after the initial sensitization;'** and the observa- 
tion that patients with IBS (and animal models of EAL) 
show increased stress responsiveness (for example, SNS 
activity'°’) and increased responsiveness of brain cir- 
cuits related to salience detection, emotional arousal and 
autonomic response.'°)!>* 

Additional evidence was examined from a pilot study 
in which the PBMC gene expression profiles from 
20 patients with IBS (12 female) and 20 healthy indi- 
viduals (nine female).'** Analyses identified 280 gene 
transcripts showing >10% differential expression across 
groups (134 genes upregulated in PBMC from IBS, and 
146 downregulated). Promoter-based bioinformatics 
analysis implicated several transcription control path- 
ways in structuring the observed transcriptome differ- 
ences, including increased activity of CREB transcription 
factors (which mediate B-adrenergic signalling from the 
SNS), growth control pathways (for example, the MAPK- 
responsive transcription factor ELK1), oxidative stress 
response pathways (NRF2), and pathways involved in 
growth factor and cytokine signalling (STAT). However, 
PBMCs derived from patients with IBS did not differ 
from those from healthy individuals in the activity of 
proinflammatory transcription factors such as NF«B 
or AP-1. Transcript origin analyses'™ indicated that IBS 
upregulated genes derived predominately from mono- 
cytes and dendritic cells. Additional transcriptome repre- 
sentation analyses suggest that these effects stemmed at 
least in part from upregulation of immature (CD16) 
monocytes within the PBMC pool of patients with IBS. 
These results are all consistent with a pattern of increased 
myeloid lineage cell development in patients with IBS, 
which might stem from tonically increased SNS signalling 
to bone marrow myelopoietic processes.'*° 

Consistent with the hypothesis that CNS processes 
might mediate relationships between IBS and periph- 
eral myelopoiesis, differential signalling by the myeloid 
lineage transcription factor MZF-1 was positively 
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associated with the morphometry of brain regions of the 
emotional arousal network’ (previously reported to be 
altered in IBS7*?°!°), These associations included bilat- 
eral amygdala, hippocampal and anterior INS volumes, 
and bilateral anterior INS cortical thickness (all P<0.05). 
Moreover, these associations were reasonably specific 
in that other brain regions showed either no associ- 
ation, or negative associations (for example, bilateral 
cerebellar volumes). 

In conjunction with the published data reviewed 
here, these preliminary results suggest that key regions 
of the emotional arousal and/or central autonomic net- 
works, which differ structurally between IBS and healthy 
individuals are also related to differential gene regula- 
tion in the peripheral immune system. We hypothesize 
that these chronic influences of the brain on peripheral 
monocytes (‘top dowr’) might have a role in the reported 
PBMC abnormalities in IBS, whereas primed monocytes 
migrating to the brain might have a role in the (‘bottom 
up’) generation of visceral hypersensitivity, anxiety and 
neuroplasticity during recurrent stressors. 


The gut microbiota component 
Evidence from rodent studies supporting bidirectional 
interactions between the gut microbiota and the nervous 
system (both CNS and ENS) has been summarized in 
numerous review articles.'°” © Even though various sig- 
nalling mechanisms underlying such interactions have 
been proposed, detailed mechanistic studies regarding the 
relative contributions of neural, hormonal, metabolite or 
immune-mediated factors are required to draw definitive 
conclusions. Evidence from human studies using different 
endpoints (symptoms, brain imaging studies) confirming 
the rodent findings, or identifying a definitive pattern of 
dysbiosis in patients with IBS are limited.'® 
Conflicting evidence exists regarding alterations 
to the organization and function of the gut micro- 
biota in patients with chronic abdominal pain and 
in adult and paediatric patients with IBS.*!”” Several 
studies examining faecal samples from patients with 
IBS reported decreased proportions of the genera 
Bifidobacterium and Lactobacillus, and increased ratios 
of Firmicutes:Bacteroidetes at the phylum level, even 
though a causal role of these microbial changes in clini- 
cal symptoms has not been established. On the other 
hand, one might speculate that some of these changes 
might be related to alterations in regional gut transit 
and secretion secondary to altered ANS output. A study 
in a cohort of well-phenotyped patients with IBS high- 
lights the complexities of brain—gut-microbiota altera- 
tions in IBS.*' IBS subgroups were identified based on 
hierarchical clustering of operational taxonomic unit 
information from 16S ribosomal RNA analyses.*! Two 
IBS clusters were clearly separated from each other, from 
a ‘normal-like’ IBS sample and from the healthy control 
sample. Although the normal-like IBS sample showed 
normal diversity and normal Firmicutes:Bacteroidetes 
ratio (as in, similar to the pattern in the healthy control 
sample), another cluster showed diminished, and a third 
cluster showed increased diversity. Both of these clusters 


showed increased Firmicutes:Bacteroidetes ratios. Only 
the normal-like IBS group showed a markedly elevated 
rate of depression symptoms (40%). In addition, pro- 
longed colonic transit times, common in patients with 
depression correlated with the prevalence of 17 taxa.*? 
In a randomized, placebo-controlled study of healthy 
men and women, psychological distress and anxiety 
improved after taking a Lactobacillus-containing and 
Bifidobacterium-containing probiotic compared to those 
taking a matched control product, although another study 
using a different Lactobacillius probiotic failed to confirm 
these findings.'°°' 

Findings from brain imaging studies in healthy indi- 
viduals and patients with IBS has provided some evi- 
dence supporting reported rodent gut microbiota—brain 
interactions. One study has shown that chronic ingestion 
of a probiotic consortium (Bifididobacterium animalis 
ssp. lactis, Lactococcus lactis spp. lactis, Lactobacillus 
delbrueckii spp. bulgaricus and Streptococcus thermo- 
philus) for 4 weeks altered functional brain responses to 
an emotional face recognition task in healthy women.” 
Compared with two control groups (one group received a 
nonfermented milk product and the other no treatment), 
the women who had ingested the probiotic had a reduced 
response to the task across a wide network of brain 
regions that included sensory and emotional regions. 
Although no treatment-related changes in self-report of 
symptoms of anxiety or depression were seen, the find- 
ings suggest a basic change in responsiveness to negative 
emotional stimuli in the environment. No organizational 
changes in the gut microbiota were observed in this and 
a previous nonimaging study using the same interven- 
tion.'® The effects of the probiotic intervention on brain 
responsiveness were probably mediated by a change in 
microbial-derived metabolites; this hypothesis will have 
to be confirmed in future studies. Preliminary evidence 
investigating correlations between gut microbial metabo- 
lites and brain structure in healthy individuals and 
patients with IBS demonstrated statistically significant 
correlations between several metabolites and structural 
aspects of several brain regions.!°° 


An integrated model 

Hypothesis 

On the evidence reviewed here, the following model is 
proposed that lends itself to experimental evaluation. 
The hypersensitive brain, and presumably ENS, shows 
increased responses to a variety of viscerosensory and 
exterosensory stimuli, which by themselves might not be 
consciously perceived in healthy individuals or in patients 
with inflammatory bowel disorders in whom intact 
descending inhibitory bulbospinal influences reduce 
dorsal horn excitability.*”'** This hyper-responsiveness 
might be a primary genetic or epigenetic alteration in 
certain brain networks (autonomic, emotional arousal or 
salience), or might be secondary to chronic experience 
of increased sensory input from the gut. Such increased 
viscerosensory signalling could originate from any of the 
elements of the gut connectome. Altered brain networks 
generate altered signals, which are transmitted to the 
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secretion), the gut microbiota and the external environ- 
ment (Figures 1 and 5). The nonlinearity of the model is 
reflected by the complexity of these macroscopic com- 
ponents and their macroscopic interactions, which form 
circular regulatory loops. Applied to the microscopic 
level, the model proposes the interactions between highly 
diverse cells types making up the various macroscopic 
components (Figure 5). 

Given the complexity of the expanded gut-microbiota- 
brain-environment axis, and with the rapid advance of ana- 
lytical tools, in particular of ‘-omics’ technologies—in the 
case of IBS studies, primarily genomics, transcriptomics, 
proteomics, metabolomics, brain connectomics and 
their integration into, for instance, gene and protein 
networks—we believe that systems biological approaches 
are essential to fully understand symptom generation and 
to identify novel treatment approaches in the future. 

The multiorgan, systemic view of IBS leads to the pre- 
diction that disease models focusing on individual cellu- 
lar components at the macroscopic level (brain, immune 
system, gut, microbiota, environment) will have limited 
validity. Furthermore, this view predicts that therapies 
targeted at single organs (for example, brain, immune 
system, gut), single mechanisms (for example, secretion, 
motility, gut microbiota, pain, diet), or single molecular 
targets (such as ion channels or receptors) are not likely to 
be successful in treating the entire syndrome, as evidenced 
by the limited success in drug development for IBS to date, 
reflected in rather small effect sizes on overall changes in 
IBS symptoms. This problem is compounded by the fact 
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Figure 5 | Systems biological model of IBS. Schematic illustrating a systems 
biological view of components involved in the development of IBS at the cellular 
and molecular level. Systems-based interactions between central and peripheral 
components of the brain—gut axis can be studied at the level of the genome, 
epigenome, transcriptome, proteome, metabolome and brain connectome. 


periphery through the ANS, HPA axis and descending 
modulation to the dorsal horn. The chronically increased 
ANS output, results in an extensive remodelling of various 
peripheral cells in the immune system, ENS, gut epithe- 
lium (permeability), and in the composition and function 
of the gut microbiota, all contributing to sensitization of 
visceral afferent pathways, and increased viscerosensory 
feedback to the brain, reinforcing the circular regulatory 
loops. Memory formations at the level of the immune 
system, the nervous system and the gut microbiota early in 
life are likely to contribute to the chronicity of symptoms. 


Systems biology view 

The evidence reviewed above strongly suggests that IBS is 
a systems disease (Figure 5), involving not only complex 
individual systems—nervous, immune, digestive, micro- 
biota and the environment—but also their complex, non- 
linear, reciprocal interactions. Thus, we propose a systems 
biological view of IBS pathophysiology, an approach taken 
in other areas of biology and disease'®'® involving a large 
number of functionally diverse, interacting components, 
each of which contributes only a small fraction to the vari- 
ance of the symptoms. These components interact highly 
selectively at multiple scales and typically in a nonlinear 
fashion to produce coherent behaviours and outcomes.'®” 
Applied on a macroscopic level, we propose this model 
to characterize the interactions of the nervous system, 
immune system, the gut environment (motility and 


that different subsets of patients can be characterized by 
different patterns of interactions between the components 
of the overall system, resulting in differential responses to 
a particular therapy. 

Indeed, each isolated mechanism proposed over the 
past 30 years only explains a small fraction of the vari- 
ance of the clinical phenotype” and no isolated therapies 
have had better effectiveness than ~10% over placebo.'® 
In addition, the systems view predicts that a single agent 
targeting multiple components of the system (tricyclic 
antidepressants, 5-HT, antagonists), or combined thera- 
pies, aimed at simultaneously targeting multiple organs 
and mechanisms (combination of laxatives, antidiarrhoeal 
agents, probiotics and centrally acting drugs) are likely to 
be more effective in the clinic, than individual treatments 
by themselves. Although no controlled studies have been 
conducted yet and are needed, in our opinion, this sug- 
gested integrated approach is in part supported by current 
clinical practices and the consensus of most physicians. 

Although still in their infancy, integrated multiomic 
approaches could be essential in the future to better under- 
stand the disease spectrum for IBS at the system level and 
the underlying mechanisms. Ideally, one would like to 
combine high-throughput microscopy and brain imaging 
to obtain brain connectome variables, and sequencing 
and mass spectrometry methods to obtain complete 
measurements of genomic, epigenomic, transcriptomic, 
proteomic and metabolomic variables simultaneously 
in multiple organs and tissues (in particular in gut and 
brain), including the microbiota. Data derived from these 
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—omics approaches can be integratively analysed in com- 
bination with other sources of information ranging from 
public databases to the literature to identify, for instance, 
targetable hubs. This integrated -omics approach faces 
major obstacles stemming from the obvious difficulties 
of performing some of these analyses in human patients 
(for example, nervous system tissue). However, impor- 
tant steps can be taken in this direction both using rodent 
models!°*!!!” and by performing partial measurements 
in humans. 

Some of these analyses should be performed at multiple 
time points along the circadian cycle. Circadian rhythms 
are found at the molecular level in all tissues, both centrally 
and peripherally, and have a fundamental role in coordin- 
ating the physiology and homeostasis of the organism. 
Indeed integrative system biology analyses” have revealed 
that about 10% of all transcripts and metabolites oscillate 
in a circadian manner in any tissue,'” with little overlap 
across different tissues beyond the core molecular clock 
comprising a transcriptional-translational negative feed- 
back loop coordinated by a dozen genes.'”* Furthermore, 
the list of oscillating molecular species in a given tissue is 
altered by genetic, epigenetic and environmental pertur- 
bations and thus provides a characteristic physiological 
signature of a tissue and its health state.'!’*'”° Furthermore, 
complex reciprocal interactions exist between the mol- 
ecular rhythms found in different brain regions and in 
other organs. Thus, in short, it is reasonable to predict 
that IBS should result in perturbed lists of transcripts 
and metabolites that oscillate in a circadian manner both 
centrally in specific brain areas as well as peripherally, for 
instance in the gut. 


Conclusions 

Clearly, none of the growing list of individual abnormali- 
ties identified in patients with IBS by itself can account 
for the variance of the clinical phenotype. For the same 
reason, such individual abnormalities are unlikely to rep- 
resent reliable biomarkers and are not likely to represent 
suitable targets for the development of highly effective 
treatments. Rather, as depicted schematically in Figures 1 
and 5, the clinical phenotype emerges from the inter- 
actions of multiple systems in the periphery (gut con- 
nectome, microbiome, genome and epigenome) and in 
the brain (connectome) interacting with each other 
in bidirectional ways. Most consistent with a systems 
view is the concept that central and peripheral abnor- 
malities form circular loops that reinforce each other. 
In the absence of comprehensive phenotyping studies 
in patients with IBS performed longitudinally with or 
without therapeutic interventions, and without targeted 
mechanistic animal studies, it remains unclear which of 
these reported abnormalities are primary and which are 
secondary. On the basis of this assessment, we suggest 
that in the future, high-throughput -omics measurement 
across both tissues and time, combined with comprehen- 
sive characterization of clinical, behavioural and brain 
endophenotypes, should enable more accurate differen- 
tial analyses, uncover complex system-level interactions 
and, ultimately, help develop more efficient, multi- 
pronged therapies.'”” Such a system biological approach 
might not only hold promise when applied to IBS and 
related functional gastrointestinal disorders, but also 
for inflammatory diseases of the gut, such as IBD and 
coeliac disease.'” 
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PRIMER | 


Irritable bowel Syndrome 


and Robin C. Spiller" 


summarized in this Primer. 


Irritable bowel syndrome (IBS) is a functional bowel 
disorder (that is, not associated with structural or bio- 
chemical abnormalities that are detectable with the cur- 
rent routine diagnostic tools) characterized by abdominal 
pain or discomfort, stool irregularities and bloating 
(BOX 1). Symptoms can be debilitating in many individ- 
uals, but may be mild or moderate in other patients. 
In addition, IBS is often associated with other somatic 
comorbidities (for example, pain syndromes, overactive 
bladder and migraine), psychiatric conditions (includ- 
ing depression and anxiety) and visceral sensitivity. The 
population prevalence of IBS is high (~11%) and the 
condition has considerable consequences for quality of 
life (QOL) that are comparable to other chronic diseases, 
such as diabetes mellitus and hepatitis. IBS is diagnosed 
based on symptoms, and a distinction is made between 
the following subtypes of IBS: IBS with pain or discom- 
fort and predominant constipation (IBS-C), IBS with 
diarrhoea (IBS-D), mixed IBS (IBS-M) and unsubtyped 
IBS (IBS-U) (Fic. 1). Moreover, other diseases (including 
other functional gastrointestinal diseases, such as func- 
tional dyspepsia and gastroesophageal reflux disease) that 
may cause the typical IBS symptoms should be excluded. 
Although a substantial proportion of patients will experi- 
ence spontaneous remission over time, there is currently 
no treatment that cures IBS; relief of symptoms is the 
most that can be achieved. 


Paul Enck', Qasim Aziz?, Giovanni Barbara’, Adam D. Farmer’, Shin Fukudo%, 
Emeran A. Mayer’, Beate Niesler®, Eamonn M. M. Quigley’, Mirjana Rajilic-Stojanovicé, 
Michael Schemann’, Juliane Schwille-Kiuntke', Magnus Simren'?, Stephan Zipfel' 


Abstract | Irritable bowel syndrome (IBS) is a functional gastrointestinal disease with a high population 
prevalence. The disorder can be debilitating in some patients, whereas others may have mild or 
moderate symptoms. The most important single risk factors are female sex, younger age and 
preceding gastrointestinal infections. Clinical symptoms of IBS include abdominal pain or discomfort, 
stool irregularities and bloating, as well as other somatic, visceral and psychiatric comorbidities. 
Currently, the diagnosis of IBS is based on symptoms and the exclusion of other organic diseases, 

and therapy includes drug treatment of the predominant symptoms, nutrition and psychotherapy. 
Although the underlying pathogenesis is far from understood, aetiological factors include increased 
epithelial hyperpermeability, dysbiosis, inflammation, visceral hypersensitivity, epigenetics and 
genetics, and altered brain—gut interactions. IBS considerably affects quality of life and imposes a 
profound burden on patients, physicians and the health-care system. The past decade has seen 
remarkable progress in our understanding of functional bowel disorders such as IBS that will be 


IBS is a multifactorial disease. Hence, the under- 
lying pathogenesis is considered complex and the pre- 
cise molecular pathophysiology is far from understood. 
Several functional alterations have been described, such 
as altered visceral sensitivity, functional brain alterations, 
bowel motility and secretory dysfunctions, and somatic 
and psychiatric comorbidities. Furthermore, gastroin- 
testinal abnormalities — such as immune activation, 
gut dysbiosis (microbial imbalance), impaired mucosal 
functions, nerve sensitization, post-infectious plasticity, 
altered expression and release of mucosal and immune 
mediators, and altered gene expression profiles — have 
been associated with IBS. However, a coherent link 
between particular pathologies and IBS symptoms is 
yet to be established. 

Moreover, results from studies assessing the contrib- 
ution of most of the proposed pathological factors are 
inconsistent and the particular aetiology is often not 
related to particular gut symptoms. For example, some 
studies have found evidence for gut micro-inflammation 
in IBS, whereas others could not confirm this finding, 
despite similar gastrointestinal symptoms. Such dis- 
crepancies, which also apply to the other biomarker 
candidates (not only to inflammation), strongly sug- 
gest the existence of IBS subpopulations, which, despite 
the similarity in gut symptoms, can be defined and 
distinguished by their pathophysiology and in-depth 
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assessments of clinical and molecular biomarker clus- 
ters. The same heterogeneity is evident with respect to 
clinical diagnosis and management. Indeed, medical 
treatment, nutritional intervention and psychotherapy 
lack consistent and homogeneous efficacy, but can be 
effective in some subgroups. 

This Primer summarizes recent progress in our 
understanding of IBS prevalence, comorbidities, 
QOL and the putative roles of inflammation, genet- 
ics, the intestinal microbiota and the brain-gut axis 
in IBS pathogenesis. Furthermore, we will discuss the 
current diagnostic approach and highlight the thera- 
peutic options in IBS, including drugs, nutrition 
and psychotherapy. 


Epidemiology 

Global prevalence and incidence 

Prevalence rates of IBS vary between 1.1% and 45%, 
based on population studies from countries world- 
wide (FIG. 2; Supplementary information S1 (table)), 
with a pooled global prevalence of 11.2% (95% CTI: 
9.8-12.8)'. Prevalence rates of 5-10% are reported for 


Box 1| IBS definition and subtypes: Rome Ill criteria 


Diagnostic criteria* for irritable bowel syndrome (IBS) include recurrent abdominal 
pain or discomfort at least 3 days per month in the past 3 months associated with two 
or more of the following: 

e Improvement with defaecation 

e Onset associated with a change in the frequency of stool 

e Onset associated with a change in the form (appearance) of stool 

*Criteria fulfilled for the past 3 months with symptom onset at least 6 months before diagnosis. 
*Discomfort means an uncomfortable sensation not described as pain. In pathophysiological 
research and clinical trials, a pain or discomfort frequency of at least 2 days per week during 


screening evaluation for subject eligibility. Adapted with permission from REF. 119, American 
Gastroenterology Association. 


most European countries, the United States and China’. 
Population statistics for IBS in most African and many 
Asian countries are unavailable, which might point to 
the inability to differentiate between infectious diarrhoea 
and IBS in tropical countries, especially in those nations 
with poor health-care systems or limited patient access 
to medical care, or to less attention of the health-care 
system for functional disorders, once an acute infection 
has been excluded’. 

Gathering subtype-specific prevalence information is 
complex. IBS subtypes overlap considerably in terms of 
symptoms, and patients vary over time in terms of their 
predominant symptoms, and thus switch subtype’*. The 
few population studies that have differentiated between 
IBS subtypes suggest that, in countries with a total IBS 
prevalence of ~10%, IBS-C and IBS-D each account for 
one-third of the affected population*. Incidence rates of 
IBS (that is, the annual occurrence of new cases) are not 
reported for most countries, but a few long-term sur- 
veys (210 years) in the United States allow for an esti- 
mation of the annual incidence in the range of 1-2%”. 
At the same time, disappearance rates of 2% have been 
reported’®, indicating spontaneous disease remission. 


Association between IBS and other disorders 

Not only do IBS subtypes overlap® but population-based 
studies also report a substantial overlap of =20% with 
other functional gastrointestinal disorders of the upper 
and lower gastrointestinal system: functional dyspepsia, 
heartburn, gastroesophageal reflux disease and nausea 
on the one hand’, and diarrhoea, incontinence, pelvic 
floor dyssynergia and constipation on the other hand’. 
An overlap of IBS with inflammatory bowel diseases 
(IBDs; including Crohn disease and ulcerative colitis) 
during remission phases has been proposed’ but is not 
mutually agreed on"®. 

Other IBS-associated disorders (FIG. 3) include func- 
tional non-gastrointestinal syndromes, such as urological 
chronic pelvic pain syndrome (this term includes inter- 
stitial cystitis and chronic prostatitis), vulvodynia, over- 
active bladder, prostatic pain syndrome, premenstrual 
syndrome, sexual (including erectile) dysfunction, 
chronic pelvic pain, fibromyalgia syndrome, chronic 
fatigue syndrome, migraine, eating disorders, nutri- 
tional intolerances and others". All of these syndromes 
considerably overlap with IBS in population studies to 
a degree that is often beyond what is expected based on 
the prevalence rates of the individual diseases. Given that 
many of these conditions are only diagnosed in special- 
ized centres, it has been questioned as to whether some 
of these conditions — for example, IBS and chronic pelvic 
pain — are one and the same disease’. 

In addition, most epidemiological studies note the 
presence of psychiatric comorbidities (such as anxiety, 
depression, somatization or neuroticism) not only for 
IBS but also for these IBS-associated diseases. Again, the 
rates are above the expected levels for IBS and the popu- 
lation prevalence of these symptoms’. Thus, the entire 
disease entity (IBS, functional gastrointestinal disorders 
and other functional non-gastrointestinal disorders) has 
been included in the term ‘somatic symptom disorder’ 
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Figure 1| IBS subtypes according to the Rome Ill criteria. A two-dimensional graph of 
the four possible irritable bowel syndrome (IBS) subtypes according to bowel form at a 
particular point in time, and the percentage of time this bowel form has to be present to 
meet the criteria for IBS with constipation (IBS-C), IBS with diarrhoea (IBS-D), mixed-type 
IBS (IBS-M) and unsubtyped IBS (IBS-U). Adapted with permission from REF. 119, 
American Gastroenterology Association. 


in the Diagnostic and Statistical Manual of Mental 
Disorders 5th Edition (DSM-5)"* and in psychiatric or 
psychosomatic clinical management”. Patients with 
IBS who were treated by psychiatrists frequently did 
not receive adequate attention with respect to their 
gastrointestinal symptoms before the release of DSM-5. 


Risk factors for IBS 

The best-documented risk factor for IBS is female sex, 
which is associated with an odds ratio of 1.67 (95% CI: 
1.53-1.82) across many population-based studies"®, 
with explanations varying between sex-different health 
care, consultation behaviour and biological functions 
(for example, hormonal regulation of gut functions). 
The incidence of IBS decreases with advancing age 
(>50 years)’, but is similar in children and adolescents 
compared with adults and does not necessarily trans- 
mit from childhood to adulthood’. However, family 
aggregation has been reported"* that is driven by genet- 
ics’? as well as by social learning”. BOX 2 lists the per- 
sonal, disease, psychosocial and social factors that have 
been found to be associated with increased risk of IBS, 
although some of these factors have only been identi- 
fied in individual studies”! or have been found to vary 
between countries and settings. 


Post-infectious IBS 

Several studies have shown an association between IBS 
and preceding gastrointestinal infections of bacterial, 
viral or other origin”. The pooled odds ratio is 7.3 
(95% CI: 4.7-11.1) for the development of IBS after 
infectious gastroenteritis”, with a median prevalence 
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of ~10%”. This association seems to differ with respect 
to epidemic infectious events that affect many people at 
the same time and individual infections, such as travel- 
lers’ diarrhoea. That is, prevalence data are reported to be 
higher (15-30%) in epidemic events” and lower (5-10%) 
following travellers’ diarrhoea”’; these differences are 
presumably due to different reporting biases in these 
populations. Thus, a median prevalence of 10% might 
better reflect the true prevalence of post-infectious IBS 
than the extreme values reported in individual studies. 
Risk factors for the development of post-infectious IBS 
are female sex, younger age, the severity of the initial 
infection and premorbid psychological conditions”. 
Based on symptoms alone, post-infectious IBS cannot 
be distinguished from IBS without an infectious origin, 
but inflammatory biomarkers may. The most valid dis- 
tinction may be a sudden onset that is well remembered 
by the patient and is associated with fever, bloody stools 
and a positive laboratory stool test for an infective agent. 


Mechanisms/pathophysiology 

Although the aetiology of IBS remains largely undeter- 
mined, our understanding of the potential mechanisms 
involved in gut dysfunction, visceral sensation and 
symptom generation is rapidly advancing. Growing 
evidence suggests that, in IBS, the epithelial barrier, gut 
microbiota, food antigens and bile acids elicit abnormal 
responses in the key regulators of sensorimotor func- 
tions, including the hypothalamus-pituitary—adrenal 
(HPA) axis, the immune system, the brain—gut axis and 
the enteric nervous system (ENS) (FIG. 4). Accordingly, 
these factors might have a role as potential biomark- 
ers of disease (BOX 3). In addition to these putative bio- 
markers, psychological factors (‘psychomarkers’) such 
as depression and anxiety, which are known to respond 
to abdominal symptoms (bottom-up), and psychosocial 
factors (‘stress’) that influence physiological (intestinal) 
functions, such as motility and visceral sensitivity (top- 
down), have been acknowledged and will be discussed 
in more detail. 


The epithelial barrier 

The epithelial gut lining represents an enormous sur- 
face that is in constant contact with the environment 
and with billions of bacteria that constantly challenge 
the intestinal immune system. Increased intestinal per- 
meability is considered an early event in IBS that leads to 
low-grade immune cell infiltration of the gut mucosa”. 
Indeed, increased epithelial permeability has been pri- 
marily described in post-infectious IBS in general and 
in IBS-D in particular, although some reports have also 
shown that IBS-C and IBS-M might also involve an 
increase in epithelial permeability”. Evidence for the 
presence of this remodelling in IBS has been provided by 
electron microscopy, which has detected enlarged spaces 
between epithelial cells and cytoskeletal condensation 
in gut biopsies of patients with IBS-D”*°. In addition, 
Ussing chamber experiments, which measure epithe- 
lial membrane properties on colonic mucosal biop- 
sies, have shown excessive passage of macromolecules 
from the luminal to the basolateral side of gut tissue 
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Figure 2 | IBS prevalence in population studies around the world. Pooled prevalence data per country are colour- 
coded. Data from REF. | are supplemented by studies from another nine countries (see Supplementary information S1 


(table)). IBS, irritable bowel syndrome; N/A, not applicable. 


in biopsies obtained from patients with IBS compared 
with asymptomatic controls, hence providing the func- 
tional correlate for the described structural epithelial 
barrier defects”’. 

Morphological and functional changes in intestinal 
permeability are related to abnormal gene and protein 
expression of tight junction proteins, including a reduc- 
tion in the expression of occludin and zonula occludens 
protein 1 (REFS 25,28). These findings have recently been 
corroborated by genetic and epigenetic findings in tight 
junction proteins claudin 1, claudin 2 and cingulin, as 
outlined below. Tight junction changes are probably 
the result of both bacterial-mediated and proteasome- 
mediated degradation triggered by low-grade inflamma- 
tion”. Accordingly, inflammatory mediators including 
eicosanoids, histamine and proteases increase intestinal 
permeability. This may involve the participation of ENS 
neurons, which may amplify these effects”””®. 

Increased intestinal permeability has been linked to 
diarrhoea and pain severity”, suggesting that this mech- 
anism might have a role in symptom generation in IBS. 
Although the exact causes underlying the ‘leaky’ gut barrier 
in IBS remain elusive, it has been postulated that numer- 
ous factors could be involved, including genetics, epi- 
genetics, dysbiosis and food allergies”. Confocal laser 
endomicroscopy of the duodenal mucosa of patients with 
IBS after challenge with food to which the patients reported 
intolerance showed epithelial breaks and increased inter- 
villous spaces, indicative of increased intestinal perme- 
ability. These studies suggest a causative effect of food 
in the increased epithelial permeability in IBS*’. 


Bile acids 

A subset of patients with features compatible with IBS-D 
present with increased levels of total faecal bile acids 
caused by increased excretion and synthesis of serum C4 
(7a-hydroxy-4-cholesten-3-one; a surrogate for bile 
acid synthesis), which in turn influences bowel habit 
by accelerating colonic transit and inducing diarrhoea 
and visceral hypersensitivity in IBS***. Of note, genes 
involved in bile acid metabolism and function have been 
reported to be associated with colonic transit in IBS-D, as 
outlined below. 


Immune response 
It has been argued that the immune system participates 
in the pathophysiology of IBS based on the clinical obser- 
vation that infectious gastroenteritis is a strong risk factor 
for the development of IBS™. Additional clinical support 
comes from the evidence that about one-third of patients 
with IBD in remission experience IBS-like symptoms”. 
These inferential data have been subsequently enriched 
by quantitative immunohistochemistry data showing 
increased infiltration of T cells and mast cells in the mucosa 
of the small and large intestine of some patients with IBS”. 
Two randomized controlled trials (RCTs)*”** in patients 
with IBS demonstrated that the anti-inflammatory agent 
mesalazine was not superior to placebo in alleviating IBS 
symptoms, although both studies clearly indicated that 
subgroups, particularly patients with post-infectious IBS, 
had sustained symptomatic responses. Thus, these stud- 
ies confirm the hypothesis that immune activation has a 
considerable role in some patients with IBS. 


4| 2016 | VOLUME 2 


www.nature.com/nrdp 


© 2016 Macmillan Publishers Limited. All rights reserved. 


Somatic symptom disorder 


‘ “ cynronic pelvic pain Ch, - 


Sve mer 
— Ydrome Pain syeO" 


Somatization 


O Psychiatric disorders O Functional gastrointestinal disorders 


© Functional non-gastrointestinal disorders © IBS 


Figure 3 | IBS-associated comorbidities. A model of irritable bowel syndrome (IBS) and 
its associations with other clinical, intestinal, extra-intestinal and psychiatric conditions. 
For each of the listed disorders, overlap with IBS symptoms has been reported in the 
literature'’. The different components should be viewed as layers of complexity: the IBS 
subtypes are part of the group of functional bowel disorders, these are part of all kinds of 
functional disorders and these again are part of a ‘layer’ of psychiatric disorders. GERD, 
gastroesophageal reflux disease; IBS-C, IBS with constipation; IBS-D, IBS with diarrhoea; 
IBS-M, mixed-type IBS; IBS-U, unsubtyped IBS; PMS, premenstrual syndrome. 


Although mucosal immunocyte numbers are not 
always increased in IBS, there is strong functional and 
molecular evidence of an increased state of activation of 
immune cells in about half of patients with IBS**. Data 
from several studies point to the importance of mast cells 
as key components of inducing and maintaining low- 
grade immune activation in IBS”*. For instance, higher 
proportions of mast cells were found in a degranulating 
state in colonic biopsies from patients with IBS than in 
control samples, suggesting that increased activation 
of mast cells is involved in the condition”. In addition, 
biopsy supernatants from patients with IBS contained 
higher amounts of mast cell mediators, including pro- 
teases and histamine” as well as polyunsaturated fatty 
acid metabolites*, than controls. Mucosal immune 
activation is coupled with altered gene expression 
of several components of the host mucosal immune 
response to microbial pathogens (see below), suggesting 
that the microbiota might contribute to the observed 
immune activation”. 
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Neuroimmune interactions 

Mucosal mediators isolated from biopsy samples from 
patients with IBS have been extensively studied to 
identify their effect on bowel physiology and sensory 
perception in isolated tissues or laboratory animals”. 
Compared with controls, mucosal mediators from 
patients with IBS evoked higher activation of visceral 
and somatic pain pathways when applied to intestinal 
preparations isolated from rodents”. Mast cells and 
enteroendocrine cells have been suggested to partici- 
pate in this abnormal neural signalling, as indicated 
by the activation of human ENS neurons via mast cell- 
derived histamine, enteroendocrine cell-derived sero- 
tonin (also known as 5-hydroxytryptamine (5-HT)) 
and protease-dependent mechanisms?*”*” (FIG. 5). 
Although most of the proteases are secreted by mast 
cells, some of the serine and cysteine proteases that 
are present at a higher level in the mucosa or stool of 
patients with IBS than controls might be of other, prob- 
ably pancreatic or bacterial, origin. In line with these 
findings, serine proteases in faecal supernatants from 
individuals with IBS-D evoked colonic hypersensitiv- 
ity to distension™. By contrast, faecal cysteine protease 
activity was augmented in some patients with IBS-C 
compared with controls and increased protease activity 
correlated with abdominal pain and impaired epithe- 
lial permeability*. Further work showed the implica- 
tion of serine proteases that act on protease-activated 
nociceptors located on intestinal nerves conveying 
pain stimuli to the brain**. Importantly, mucosal 
mediators from patients with IBS and visceral hyper- 
sensitivity — but not from normosensitive patients 
with IBS — acutely activated spinal nociceptors when 
given to animal models*. In the same model, chronic 
exposure to soluble mediators from patients with 
IBS-D was shown to sensitize nociceptive neurons”, 
implying that chronicity is associated with long-lasting 
plasticity alterations. 

Attention has been directed to agonists of the tran- 
sient receptor potential cation channels (TRPs), which 
have been implicated in the pathogenesis of sensory 
hyperalgesia. Colon tissue samples from patients with 
IBS have increased levels of specific polyunsaturated 
fatty acids, which stimulate sensory neurons from 
mice via the activation of TRP subfamily V member 4 
(TRPV4) and generate visceral hypersensitivity”. The 
importance of those visceral afferents that express TRPs 
in IBS symptomatology is underscored by the finding 
that peripheral blood mononuclear cell (PBMC) super- 
natants from patients with IBS-D cause mechanical 
hypersensitivity of visceral afferents via tumour necro- 
sis factor (TNF) and TRPA]; this was not observed if 
control supernatants were used*. 

Recent data support the concept that the chronic 
release of factors with known effects on nerves in the 
intestinal milieu might not only have functional effects 
but could also affect the ENS and sensory fibres in a 
structural manner. For example, immunohistochemistry 
showed a 57.7% and 56.1% increase in mucosal neu- 
rons and neuronal outgrowth, respectively, in patients 
with IBS compared with healthy controls”. Indeed, 
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the intestinal mucosa of patients with IBS contains 
increased levels of nerve growth factor (NGF), primar- 
ily in mucosal mast cells. Experimentally, the effect of 
NGF was demonstrated in primary cell cultures of the 
rat myenteric plexus and the neuroblastoma cell line 
SH-SY5Y, which showed an increase in neurite growth, 
and protein and mRNA expression of growth-associated 
protein 43 (GAP43; also known as neuromodulin) — a 
key neuronal growth protein — following exposure 
to supernatant obtained from mucosal biopsies of 
patients with IBS”. 


Box 2 | Risk factors for IBS 


Personal factors 

e Sex (female) 

e Age (>50 years) 

e Birth cohort* 

e Breast feeding (<6 months)* 

e Herbivore pet in childhood* 

e Birth weight (low)* 

e Body mass index (low)* 
Psychological factors 

e Illness behaviour 

e Low quality of life 

e Acute psychological stress 

e Stressful life events 

e Sexual or physical abuse history 
e Anxiety, depression or somatization 
e Intimate partner violence* 


e Addictive behaviour* 


Somatic issues 

e Gastrointestinal infection 

e Somatic symptoms (pains, for example, joint pain 
and migraine) 

e Endometriosis 

e Abdominal obesity 

e Diverticular disease (left side) 

e Antibiotic use 

e Abdominal surgery 

e Spicy food consumption* 

e Sleep problems* 


e Low exercise level* 


Social conditions 

e Socioeconomic status (childhood) 

e Family history of substance abuse 

e Family history of mental illness 

e Working conditions (insufficient autonomy)* 

e Shift work* 

e Marital status (never married)* 

e Number of family members (with more members 
increasing the risk)* 

e Childhood war exposure* 

Less well-established factors are marked (*) and are based on 


single studies (for example, REF. 21), whereas all others have 
been shown in more than one study. 


Microbiota 

The gastrointestinal microbiota is a diverse and numer- 
ous ecosystem that inhabits the entire gastrointestinal 
tract and has a systemic influence on our health. Owing 
to its enormous complexity and high interindividual 
variability, the microbiota is still in large part undefined 
regarding the scope of its contribution to human physio- 
logy and tolerable compositional variations under which 
normal functions are preserved”. The evidence for an 
involvement of altered gut microbiota composition in 
IBS pathophysiology has been accumulating (BOX 4), 
but the aetiological role remains uncertain. The most 
prominent markers of IBS are derived from uncultured 
bacteria. Two groups of uncultured Clostridiales are 
significantly depleted in IBS°'”’, and bacteria related 
to Ruminococcus torques (a species belonging to the 
Lachnospiraceae) are profoundly enriched in patients 
with IBS°!**>* and levels positively correlate with bowel 
symptoms”'»*»’. In addition, increased Firmicutes to 
Bacteroidetes ratios have been observed at the phylum 
level, at least in a subset of patients” (for a recent review 
see REF. 56). Given the provided evidence, the dysbiosis of 
microbiota in IBS has been acknowledged by the Rome 
Foundation Working Team” as a plausible contributing 
factor to the disorder. Experiments with animal mod- 
els have shown that colonization of germ-free animals 
with microbiota from patients with IBS can induce 
visceral hypersensitivity”*, impair intestinal perme- 
ability and alter gastrointestinal transit time’? — indi- 
cating the importance and the possible aetiological role 
of the microbiota in IBS. 

Although diet changes have an effect on the abun- 
dance of particular microbial groups, the microbiotic 
signature (in terms of present species) is very stable®. 
To observe a profound effect, the dietary change has 
to be dramatic (for example, vegans switching to high- 
fat and high-protein diets*'). Dietary interventions 
(such as low dietary content of fermentable oligo- 
saccharides, disaccharides, monosaccharides and poly- 
ols (FODMAPs; BOXx 5), or the addition of sweeteners 
(fructo-oligosaccharides) or fibre (psyllium)) can 
improve symptoms of some but not all patients with 
IBS. Future studies should evaluate the relevance of 
these microbial groups for IBS and could contribute to a 
better understanding of the role of the microbiota in the 
pathophysiology of IBS that is currently acknowledged 
for the following contexts. 


Fermentation of non-digestible foods. An important 
role of the microbiota is degradation of non-digestible 
dietary components”. It is generally accepted that 
fermentation of carbohydrates is desirable because 
of the beneficial effects of the main fermentation 
products — short-chain fatty acids (SCFAs) — including 
energy supply to gastrointestinal epithelial cells, a 
decrease in inflammation and improvement in gut 
barrier function®’. However, in patients with IBS, the 
presence of the resistant carbohydrates FODMAPs can 
provoke IBS symptoms™. This might be a result of over- 
production or underproduction of relevant metabolites 
owing to the disturbed microbiotic balance, for example, 
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Figure 4 | Overview of the pathophysiology of IBS. Although the aetiology of irritable 
bowel syndrome (IBS) has not yet been completely elucidated, various factors have a 
role, including composition of the gut microbiota, intestinal permeability, immune cell 
reactivity and sensitivity of the enteric nervous system, the brain—gut axis (spinal, vagal or 
pelvic pathways) or the brain. The figure highlights those mediators that are probably 
involved in IBS pathology. The plus symbols indicate whether a mediator activates or 
inhibits its target cell; those in parentheses denote actions established in animal models 
and those without parentheses are effects demonstrated in humans (human tissue). 
5-HT, 5-hydroxytryptamine (also known as serotonin); CGRP, calcitonin gene-related 
peptide; GDNF glial cell-derived neurotrophic factor; IL, interleukin; PAR2, 
proteinase-activated receptor 2; TNF, tumour necrosis factor. 


due to an increased abundance of gas-producing and 
decreased abundance of gas-utilizing microorganisms. 
The quantity and composition of SCFAs in the gut 
differs between patients with IBS and healthy con- 
trols, although the available data are not always in 
agreement. Moreover, the production of microbial 
SCFAs stimulates regulatory T cell differentiation and 
affects the balance between pro-inflammatory and anti- 
inflammatory mechanisms”, suggesting that inadequate 
levels of SCFAs could provoke low-grade intestinal 
inflammation as observed in patients with IBS*”. 
Finally, studies of microbiota show that the abundance 
of several SCFA-producing bacteria — including 
Roseburia, Blautia and Veillonella” — is significantly 
increased compared with the levels of these bacteria in 
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healthy controls, providing a potential mechanistic basis 
for the development of IBS symptoms. 

Other carbohydrate-utilizing gastrointestinal bac- 
teria — namely, Dorea spp. — show significant increases 
in abundance in patients with IBS”’; these are the main 
gas-producing bacteria in the human gastrointestinal 
tract”’. The overproduction of gas is associated with 
IBS” and this phenomenon could underlie flatulence 
and abdominal pain. The excessive production of gas 
can also cause faster colonic transit in patients with 
IBS-D, as the colons of these patients are more sensitive 
to increased intestinal volume than healthy controls”. 
Intestinal gases are efficiently removed by methano- 
genic archaea”, which seem to be depleted in patients 
with IBS°! and are negatively correlated with the pres- 
ence of loose stools’. However, a significant increase 
in the abundance of this microbial group is character- 
istic of patients with slow transit and constipation”, 
whereas the degree of the methanogenic activity could 
be correlated with the severity of constipation in those 
with IBS-C”*. 

Another potential pathway for microbiotic involve- 
ment in IBS is protein degradation. The luminal con- 
tents of patients with IBS contain increased levels 
of proteases*’, which could be due to the increased 
secretion of endogenous and microbial proteases in 
response to protein-rich nutrition (typical of western 
diets), but could also be due to insufficient endogenous 
protease degradation by the disturbed gastrointestinal 
microbial community”. Serine protease inhibitors are 
produced by many bacteria, including bifidobacteria’, 
and their activity could prevent the excessive proteo- 
lytic activity of intestinal content in IBS. The depletion 
of bifidobacteria has been noted in both faecal and 
mucosal samples of patients with IBS°'”’, suggesting 
an important role for this bacterial genera in IBS. The 
fermentation of proteins generates numerous health- 
compromising substances**. Among these, hydrogen 
sulfide is a relevant toxin that impairs epithelial metabo- 
lism*' and can be further converted to tetrathionate, 
which stimulates the growth of tetrathionate-utilizing 
pathogens from Gammaproteobacteria®*’. The abun- 
dance of several Gammaproteobacteria significantly 
correlates with bowel symptoms in patients with IBS°!”’, 
and also with the levels of the inflammatory markers 
interleukin 6 (IL-6) and IL-8 (REF. 51) that are typically 
increased in IBS™. 


Microbiota and 5-HT. 5-HT is an important metabolite 
that, among other functions, regulates gastrointestinal 
motility; disturbed levels of 5-HT seem to be relevant 
for IBS pathology**. As much as 90% of 5-HT is pro- 
duced in enteroendocrine cells present in the gastro- 
intestinal tract, and it has been recently shown that 
intestinal bacteria are needed for the stimulation of 
5-HT synthesis. Attempts to identify microorganisms 
that are capable of 5-HT synthesis have shown that, in 
contrast to Bacteroides spp. and altered Schaedler flora 
(a community of eight bacterial strains), only specific 
spore-forming commensal bacteria have this feature. 
The majority of these spore-forming bacteria belong 
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Box 3| Structural and functional biomarker candidates in IBS* 


Altered motility and stool behaviour 
e Altered colonic transit time 


e Impaired bile acid transport* 


Mucosal permeability 
e Reduced epithelial resistance® 


e Reduced expression of ZO15 


Immune imbalance 

e Increased numbers of intraepithelial CD3* lymphocytes? 

e Increased mucosal cell density and reactivity® 

e Increased nerve mast cell association in the lamia propria region’ 

e Increased levels of T,,2 cytokines in the blood! 

e TNFSF15 and TNF polymorphisms*"! 

e Increased levels of the pattern recognition receptors TLR2 and TLR45 
e Increased levels of anti-flagellin autoantibodies! 

e Increased levels of histamine and proteases in biopsy supernatants® 
e Increased production of IL-18 and TNF by PBMCs! 


e Increased levels of B-defensin 2 antimicrobial peptide* 


Neural plasticity 
e Increased nerve fibre density in the epithelium and lamina propria® 


e Mostly visceral hypersensitivity, but <40% of patients are normosensitive or 
hyposensitive 


e Mucosal biopsy supernatants activate the enteric nervous system independent of 
stool behaviour or visceral sensitivity’ 


e Mucosal biopsy supernatants activate sensory fibres and dorsal root ganglion neurons 
(mostly hypersensitive IBS) 


e PBMC supernatants evoke mechanical hypersensitivity involving cytokines and TRPA1 


Serotonin metabolism and signalling 

e Increased plasma levels of serotonin in IBS-D! 
e Increased enterochromaffin cell density® 

e Altered SERT expression and polymorphism$ 


e Serotonin receptor and transporter polymorphisms? 


Others 

e Increased levels of cysteine and serine proteases* 

e Increased levels of mucosal PARM15 

e Increased levels of BDNF and NGF 

e Increased levels of rectal PYY and somatostatin cell count® 

e Altered microbiota diversity and composition* 

BDNF, brain-derived neurotrophic factor; IBS, irritable bowel syndrome; IBS-D, IBS with 
diarrhoea; IL-1, interleukin 18; NGF, nerve growth factor; PARM1, prostate androgen-regulated 
mucin-like protein 1; PBMC, peripheral blood mononuclear cell; PPY, peptide YY; SERT, 
serotonin reuptake transporter; T,,2, T helper 2; TLR, Toll-like receptor; TNF, tumour necrosis 
factor; TNFSF15, TNF superfamily member 15; TRPA1, transient receptor potential cation 


channel subfamily A member 1; ZO1, zonula occludens 1. *Based on data available in REF. 244. 
#In stool. Intestinal biopsy. !In blood. 


to the Clostridales class within the Firmicutes phy- 
revealed 
an increase of the Firmicutes phylum members on the 
account of the Bacteroidetes members in IBS. Given that 
the Clostridiales class within the Firmicutes phylum 
are the most diverse and the most abundant group of 
the microbiota”, it is not clear if the observed feature 
of the IBS microbiota is associated with 5-HT-mediated 
pathophysiology, but this possible link should certainly 


lum. Two recent comprehensive studies*!*° 


be further investigated. 


Brain and behaviour 

IBS is narrowly defined by recurrent abdominal pain 
and discomfort associated with altered bowel habits 
in the absence of an organic origin and/or explanation 
of symptoms. However, given that IBS is nearly always 
associated with increased anxiety and patients often 
show comorbidities with other chronic pain and psychi- 
atric conditions, a more widespread dysregulation of the 
nervous and immune systems is probably implicated*. 

The brain, the gut and its microbiota and the immune 
system show reciprocal associations in health and dis- 
ease. On the one hand, the brain, via the autonomic 
nervous system and the HPA axis, can influence intes- 
tinal motility and fluid secretion®’, intestinal epithelial 
permeability”***’, immune function” and gut micro- 
bial composition”’, all of which have been reported to 
be dysregulated in IBS. On the other hand, several of 
these peripheral alterations can influence brain structure 
and function either developmentally or in response to 
acute perturbations, setting up circular regulatory loops 
between the gut and the brain”. 

In addition to its role in the bidirectional communi- 
cations with the gut, the brain plays an essential part in 
assessing the salience of received or expected intero- 
ceptive (sensory) information”, determining how much 
of this information is amplified or tuned down, to what 
degree it is modulated by affect** and how much of this 
interoceptive information from the gut is consciously 
perceived (visceral sensitivity). One of the best-studied 
behavioural aspects of IBS-related central processing 
of gut-related information involves a coping strategy 
referred to as catastrophizing, a term that refers to a 
bias towards prediction of a high likelihood of worst 
outcomes”. This measure strongly correlates with the 
severity of pain symptoms and is a primary treatment 
target in cognitive—behavioural therapy. 

Multimodal brain imaging has made it possible to 
identify differences in functional (evoked and resting 
state) and structural (grey matter and white matter 
tracts) aspects of specific brain networks that provide 
a neurobiological substrate for previously observed 
affective and cognitive features of IBS (reviewed in 
REFS 92,96) (FIG. 6). These networks include the salience, 
attention, sensorimotor and emotional arousal net- 
works. Profound sex-related differences in these 
networks have also been identified in both healthy 
individuals and patients with IBS (reviewed in REF. 96). 
Cross-sectional correlations of brain networks with sev- 
eral clinical and non-brain biological parameters show 
a relationship between some of these brain signatures 
with IBS symptom severity and duration, a history of 
early adverse life events’’, gut metabolite and microbial 
composition”, gene expression profiles in PBMCs” and 
gene polymorphisms”. On the basis of these neuro- 
biological findings, a comprehensive IBS pathophysio- 
logical model can be formulated (FIC. 6), which includes 
alterations in the appraisal of and selective attention to 
interoceptive signals (salience and attentional network), 
central sensory processing of interoceptive information 
(sensorimotor network) and engagement of emotional 
arousal associated with experience and expectation of 
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gut sensations. This disease model not only identifies 
neurobiological correlates of well-characterized clinical 
and behavioural features of IBS but also provides a plau- 
sible explanation for the common coexistence of IBS 
with other chronic pain conditions and with increased 
trait anxiety. 

Although these findings have identified disease- 
relevant brain alterations in patients with IBS, mech- 
anistic and longitudinal studies are required to 


PRIMER 


determine the causality between these factors. For 
example, are central sensorimotor alterations a conse- 
quence of increased signals from the gut, are they the 
consequence of dorsal horn sensitization by increased 
descending pain-facilitating signals or are they a genet- 
ically determined trait that predisposes individuals to 
IBS and might be present in asymptomatic relatives!°"? 
The correlation of gut microbial signatures and PBMC 
expression profiles with structural alterations in the 
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Figure 5 |Neuroimmune interactions in the gut. An intimate anatomical 
and functional association between enteric neurons, terminals from 
extrinsic nerves and cells of the enteric immune system is the basis for 
neuroimmune interactions in the gut wall. Functional signalling between 
nerves and immune cells mostly happens in the epithelial and submucosal 
layers where there is a high density of immune cells — in particular, 
T lymphocytes, mast cells and macrophages. The neuroimmune interactions 
are bidirectional. Enteric neurons, extrinsic nerves and glial cells respond 
to cytokines and mast cell mediators. Some patients with irritable bowel 
syndrome (IBS) have circulating autoantibodies against neuronal structures 
and antibodies that are generated as a response to antigen exposure from 
the lumen. Neurons can respond directly to antibodies through direct 
activation of channels or receptors. They also respond to antigens through 
pathways involving neuronal Toll-like receptor 3 (TLR3), TLR4 and TLR7. 
Direct signalling between microbiota and the host involves activation of 
neurons through polysaccharide A. These direct effects of luminal factors 
are very likely to be outnumbered by signalling between epithelial 
(in particular, enteroendocrine cells), immune and nerve cells. Neurons also 
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express receptors for adenosine and ATP; both molecules are released in 
the gut wall under inflammatory or stress conditions. Reciprocally, nerves 
release factors that affect epithelial or immune cells. The best-documented 
effect is the activation of mast cells through the release of calcitonin 
gene-related peptide (CGRP) from extrinsic visceral afferents or enteric 
neurons. Conversely, acetylcholine (ACh) inhibits the activation of 
macrophages. Neurogenic inflammation, which is sometimes observed in 
animal models, is probably caused by the release of CGRP and substance P 
from extrinsic fibres followed by permeabilization of blood vessels. 
In addition, adipocytes in the lamina propria nestle against nerve fibres, and 
release of their pro-inflammatory mediators modulates nerve activity. The 
plus and minus symbols indicate whether a mediator activates or inhibits its 
target cell; those in parentheses denote actions established in animal 
models and those without parentheses are effects demonstrated in humans 
(human tissue). 5-HT, 5-hydroxytryptamine (also known as serotonin); CCK, 
cholecystokinin; CRF, corticotropin-releasing factor; IL-10, interleukin 10; 
NGF, nerve growth factor; PUFA, polyunsaturated fatty acid; TRP transient 
receptor potential cation channel. 
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Box 4| Dysbiosis in IBS 


Microbiota species increased in IBS 
e Enterobacteriaceae 

e Veillonella 

e Streptococcus* 

e Dorea 

e Blautia 

e Roseburia 

e Ruminococcus 

e Methanobrevibacter* 


Microbiota species decreased in IBS 
e Bifidobacterium 

¢ Collinsella 

e Streptococcus* 

e Faecalibacterium 

e Christensenellaceae 

e Clostridiales 

e Uncultured 

e Methanobrevibacter® 


IBS, irritable bowel syndrome. *IBS with diarrhoea. *IBS with 
constipation. *Mixed-type IBS. 


sensorimotor network suggests a possible role of these 
peripheral factors in influencing the brain. Similarly, are 
the altered salience and attention network alterations a 
secondary response to the chronically increased per- 
ception of visceral signals or are they a primary abnor- 
mality that is responsible for the generation of aberrant 
endogenous pain modulation, as well as emotional and 
autonomic nervous system responses? Future studies 
will need to address the question of whether these brain 
signatures differ between subgroups of patients with IBS, 
such as male and female patients, patients with a history 
of early adversity, patients with different durations of 
symptoms and patients with post-infectious IBS. 


Genetic and epigenetic data 

The latest genetic and epigenetic findings support cur- 
rent models of IBS pathogenesis that suggest disturbed 
intestinal barrier function, immune response and neuro- 
nal signal transduction™' (FIG. 6). The data even point 
towards potential diagnostic biomarkers or therapeutic 
options (BOX 3). For example, silencing the microRNA-29 
(mir-29) family or amplifying mir-199a expression might 
have important therapeutic implications for selected 
patients with IBS and symptoms caused by increased 
intestinal permeability or hypersensitivity'’””’”. 


Genetic data. Genetic studies to date range from fam- 
ily and twin studies to candidate gene approaches 
and, more recently, genome-wide association studies 
(GWAS). Regardless of enlarged sample sizes, increased 
statistical power and meta-analyses, genetic variants 
associated with IBS are still scarce and/or have not 
been replicated in independent cohorts. A recent paper 
summarizes all currently available genetic data that have 
been replicated". 


Polymorphisms or variants in several genes have been 
found to be associated with IBS. Genes encoding proteins 
involved in homeostasis of epithelial barrier function, 
such as cadherin 1 (CDH1) and cell division cycle 42 
(CDC42), the immune system, such as IL6, IL10, TNF 
and TNF superfamily member 15 (TNFSF15; encod- 
ing cytokines and neuronal signal transduction) and 
others (such as neurexophilin 1 (NXPH1) and sodium 
voltage-gated channel a-subunit 5 (SCN5A)) have been 
replicated in several studies'”’. In 2014, a small pilot study 
reported an association between IBS and a locus on chro- 
mosome 10 (containing the protocadherin 15 (PCDH15) 
gene) in a discovery sample from Australia that could not 
be replicated in additional cohorts from Sweden and the 
United States’. Mutations in the following genes encod- 
ing proteins involved in the serotonergic system have also 
been shown to be associated with IBS: solute carrier fam- 
ily 6 member 4 (SLC6A4; also known as 5-HTTLPR or 
SERT), 5-HT receptor 3A (HTR3A), HTR3E and HTR4 
(REF. 101). A polymorphism in SLC6A4 has been found 
to be associated with altered brain responses, visualized 
through functional brain imaging following visceral pain 
stimuli in patients with IBS’. Furthermore, a functional 
polymorphism in HTR3A could be associated with 
altered amygdala responsiveness, anxiety and increased 
symptom score in IBS'”. These findings underline the 
effect of polymorphic serotonergic and other genes in 
modulating gut-derived brain response in areas that 
process visceral perception and integrate autonomic con- 
trol, salience and somatosensory and emotional central 
networks (FIG. 6). 

Variants of genes encoding proteins that are involved 
in bile acid synthesis regulation (the Klotho-6 (KLB) 
gene, the fibroblast growth factor receptor 4 (FGFR4) 
gene and the G protein-coupled bile acid receptor 1 
(GPBAR1) gene) are associated with accelerated colonic 
transit in patients with IBS-D'”!*. These variants also 
correlate with the colonic transit response to cheno- 
deoxycholic acid (a bile acid used to treat constipation) 
in IBS-C’” and to colesevelam (a bile acid sequestrant 
used to treat diarrhoea) in patients with IBS-D!!°""!. 

Finally, a locus at 7p22.1 in which the genes KDEL 
endoplasmic reticulum protein retention receptor 2 
(KDELR2) and GRID2-interacting protein (GRID2IP) 
localize was significantly associated with IBS risk in 
the index GWAS (a large twin discovery sample from 
Sweden) and all replication cohorts in Europe, the 
United States and Australia'’*. However, the underlying 
molecular cause for this association finding has not 
been elucidated. 


Epigenetic data. Even less insight into the role of epi- 
genetics in IBS pathology is available compared to 
the genetic implications. To date, only a few miRNA 
studies have been performed. These studies reported on 
the differential expression profiles of miR-29a, miR-29b, 
miR-103, miR-16, miR-125b and miR-199a in the intes- 
tinal mucosa of patients with IBS-D. Upregulation of 
miR-29a and miR-29b was reported to accompany 
downregulation of the target genes encoding glu- 
tamine synthetase (GLUL)'”, claudin 1 (CLDN1) and 
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NF-«B-repressing factor (NKRF); CLDN1 and NKRF 
correlated with increased gut permeability'”. In addi- 
tion, decreased expression of miR-103, miR-16 and 
miR-125b correlated with the upregulation of the tar- 
get genes encoding the tight junction proteins claudin 2 
(CLDN2) and cingulin (CGN)'”’. In turn, a diminished 
miR-199 level correlated with an upregulation of TRPV1 
and increased visceral sensitivity'*. Moreover, variants 
residing in miRNA target regions of the 5-HT receptor 
genes HTR3E and HTR4B — namely, c.*76G>A and 
c.*61T>C — were found to be associated with IBS-D. 
Both variants were reported to impair miRNA regula- 
tion and to lead to disturbed expression regulation of 
miR-510 and miR-16, respectively'!*"'®. One pilot study 
further indicated increased levels of circulating miR-150 
and miR-342-3p in the blood of patients with IBS". Of 
note, miR-150 has been described to be associated with 
IBD and pain, whereas miR-342-3p has been predicted 
to target genes that are relevant for pain signalling, 
colonic motility and smooth muscle function". 


Diagnosis, screening and prevention 

The diagnosis of IBS relies on the patient fulfilling 
diagnostic criteria for IBS'!’ in conjunction with nor- 
mal results on a limited number of additional tests and 
investigations used to rule out other diagnoses with 
reasonable certainty (FIG. 7). Although a substantial 
proportion of clinicians'”° prefer a process of thorough 
exclusion of other diseases, the current recommendation 
is to base diagnosis on symptoms’”. There is currently 
no valid biomarker for IBS!”’. The choice of the tests or 
investigations deemed necessary to rule out other con- 
ditions varies depending on the clinical situation and 
the symptom profile of the patient. In the majority of 
cases with a typical clinical history compatible with IBS, 


Box 5| FODMAPs and a low FODMAP diet* 


FODMAPs stands for fermentable oligosaccharides (fructans present in wheat, rye, 
onion and garlic chicory; and galactans present in legumes and beans), disaccharides 
(lactose present in milk and milk products), monosaccharides (fructose present in 
artificial sweeteners) and polyols (sugar alcohols present in apples, pears, stone fruit, 
cauliflower, mushrooms and sweeteners). A low FODMAP diet may include reasonable 
amounts of: 


e Vegetables: bamboo shoots, cucumber, carrot, corn, aubergine (eggplant), lettuce, 
leafy greens, pumpkin, potato, squash, yam, tomato and courgette (zucchini), 
among others 

e Fruits: banana, cantaloupe, grapes, grapefruit, kiwifruit, kumquat, lemon, lime, 
mandarin, orange, passion fruit, pawpaw, pineapple, rhubarb and tangerine, 
among others 

e Protein: beef, chicken, canned tuna, egg, egg whites, fish, lamb, pork, shellfish, turkey, 
cold cuts, nuts and seeds, among others 

e Dairy and non-dairy alternatives: lactose-free milk, cream cheese, hard cheeses 
(cheddar, parmesan and Swiss), mozzarella and sherbet (almond milk, rice milk 
and rice-milk ice-cream), among others 

e Grains: wheat-free grains or wheat-free flours and products made with these (for 
example, bagels, breads, crackers, noodles, pancakes, pastas, pretzels and waffles), 
corn flakes, cream of rice, grits, oats, quinoa and rice, among others 

e Beverages: water, coffee and tea, and low FODMATP fruit or vegetable juices, 
among others 


*See REFS 171,174. 
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only a limited number of laboratory tests are recom- 
mended without any need to perform invasive investi- 
gations. Screening for IBS risk and for prevention of 
IBS development is currently not applicable, given the 
heterogeneity of the disease and the multiplicity of 
putative pathophysiological mechanisms. 


Diagnostic criteria 

As individual symptoms have poor sensitivity and speci- 
ficity to diagnose IBS, diagnostic criteria incorporating a 
combination of symptoms have been developed, similar 
to the DSM system within psychiatry. The first attempt 
was the so-called Manning criteria, published in 1978 
(REF. 122). In this publication, several symptoms were 
shown to be more common in patients with IBS than in 
patients with another organic gastrointestinal disease. 
By combining these symptoms, IBS could be discrimin- 
ated from other organic gastrointestinal diseases. The 
experience from the Manning criteria was then used to 
develop the Rome Foundation criteria, with three differ- 
ent versions over the past 15 years (Rome I, II and III); 
the latest criteria, the Rome III criteria, was published in 
2006 (REFS 119,123,124). The updated Rome IV criteria 
are expected in May 2016. The sensitivity and specificity 
of the Rome criteria have been found to be 69-96% and 
72-85%, respectively, in different studies, but a problem 
with these studies is how to define the gold standard for 
an IBS diagnosis!”!. 

The common feature in all of these diagnostic cri- 
teria is abdominal pain and/or discomfort associated 
with abnormal bowel habit (diarrhoea (loose and fre- 
quent stools), constipation (hard and infrequent stools) 
or alternating constipation and diarrhoea). All of these 
criteria require a certain duration and frequency of 
the symptoms to fulfil the diagnostic criteria for IBS; 
that is, the symptoms should be chronic and recurring. 
Thus, the practical clinical use of the diagnostic criteria 
for IBS involves demonstrating through the clinical his- 
tory the presence of a combination of these symptoms for 
>3 days per month in the past 3 months, with symptom 
onset 26 months before the diagnosis (Rome III criteria). 
However, it should be noted that patients with some 
organic gastrointestinal disease also meet these diagnos- 
tic criteria’?° and, as such, the sensitivity and specificity 
of these criteria is suboptimal to distinguish the different 
disease entities!””'”°. 


Clinical features 

Besides the symptoms included in the diagnostic criteria, 
there are other clinical features that support a diagnosis 
of IBS, even though none of them is mandatory for an 
IBS diagnosis. One recent study found that variations 
in stool consistency and frequency or an unpredictable 
bowel pattern (‘irregularly irregular’) could be used to 
discriminate IBS-D adequately from organic gastro- 
intestinal disease’’’. Moreover, abnormal stool frequency 
(>3 bowel movements per day or <3 bowel move- 
ments per week), excessive straining during defaeca- 
tion, urgency (having to rush to the toilet), feelings of 
incomplete evacuation and mucus with bowel move- 
ments support an IBS diagnosis, but are nonspecific’. 
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Brain-gut 


The same is true for postprandial worsening or exacer- 
bation of symptoms, which is common in IBS’, but 
is also observed in other gastrointestinal diseases. The 
presence of other functional gastrointestinal diagnoses 
(such as functional dyspepsia)'”, as well as reporting 
numerous functional non-gastrointestinal symptoms 
and syndromes (such as chronic fatigue, fibromyalgia, 
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Figure 6 | Summary of the genetic findings associated with different 
pathophysiological mechanisms underlying IBS. Irritable bowel syndrome (IBS)-related 
pathways that are potential pharmacogenetic targets are marked in red when based on 
genetic findings and in blue when based on epigenetic findings; those in black are 
currently not seen as potential pharmacogenetic targets’. Various pathways might be 
affected in specific subgroups of patients with IBS: epithelial barrier (permeability), 
immune function, impaired bile acid metabolism and function, neuronal processing and 
signal transduction via spinal afferents from the periphery to the central nervous system in 
addition to the bidirectional crosstalk via the brain—gut axis, presumably contributing to 
psychological conditions such as anxiety, depression and somatization. Brain networks 
that have been associated with structural and functional alteration in IBS are depicted. 
ADRA, adrenoceptor-a; alNS, anterior insula; aMCC, anterior midcingulate cortex; 
CDC42, cell division cycle 42; CDH1, cadherin 1; CGN, cingulin; CLDN, claudin; 

COMT, catechol-O-methyltransferase; CRHR1, corticotropin-releasing hormone 
receptor 1; FGFR4, fibroblast growth factor receptor 4; GLUL, glutamate-ammonia ligase 
(also known as glutamine synthetase); GPBAR1, G protein-coupled bile acid receptor 1; 
GRID2IP, GRID2-interacting protein; HPA, hypothalamus-—pituitary—adrenal; 

HTR, 5-hydroxytryptamine receptor; hypo, hypothalamus; IL, interleukin; KLB, Klotho-B; 
LCC, locus coeruleus complex; mir, microRNA; mPFC, medial prefrontal cortex; 

NKRF, nuclear factor-«KB-repressing factor; NR3C1, nuclear receptor subfamily 3 group C 
member 1; NTS, solitary nucleus; OFC, orbitofrontal cortex; PAG, periaqueductal grey; 
PGR, progesterone receptor; SCN5A, sodium voltage-gated channel a-subunit 5; 
sgACC, subgenual anterior cingulate cortex; SLC6A4, solute carrier family 6 member 4; 
TNF, tumour necrosis factor; TNFSF15, TNF superfamily member 15; TRPV1, transient 
receptor potential cation channel subfamily V member 1. 


uro-gynaecological symptoms, muscle and joint pain 
and sleep disturbances)'"!*° and psychological comor- 
bidity (such as anxiety and depression)"’, are all 
common and support an IBS diagnosis. 


Physical examination 

A physical examination should be part of the evalu- 
ation to reassure patients and also to help exclude 
another organic cause of the symptoms. Admittedly, 
an abdominal examination, which is part of the routine 
examination, rarely discloses a specific diagnosis (that 
is, abdominal tenderness is present in various diseases), 
but the absence of objective findings on a physical 
examination has been found to support a diagnosis of 
IBS'”. A digital rectal examination is an important part 
of the physical examination and a useful tool to identify 
patients with dyssynergic defaecation, which is important 
to exclude in patients with constipation'’*'™ as well as to 
exclude rectal cancer. Perianal inspection should also be 
part of the examination to rule out perianal fistulas and 
other relevant anal pathology. 


Laboratory tests 

From the existing literature, it is not obvious which 
laboratory test to recommend in the diagnostic work-up 
of patients with IBS symptoms. Only serological tests for 
coeliac disease seem to be more likely to be abnormal in 
patients with symptoms compatible with IBS than in the 
general population’’’, even though a large multicentre 
trial failed to confirm this!’°. However, few studies have 
systematically evaluated the usefulness of laboratory tests 
in patients with potential IBS. A recent systematic review 
demonstrated that C-reactive protein (CRP) levels of 
<0.5 mg per dl or faecal calprotectin levels of <40 ug per g 
essentially exclude IBD in patients with IBS symptoms’”’. 
On the basis of the existing literature, it seems reasonable 
to perform a complete blood count and CRP measure- 
ment, as these are inexpensive and can be used to reassure 
the health-care provider and the patient. A thyroid profile 
can be included if the clinical suspicion of thyroid disease 
is high, a serological test for coeliac disease can be recom- 
mended in patients with non-constipated IBS and — if 
there is suspicion of an inflammatory process — a faecal 
calprotectin measurement can be added. Stool analyses 
to detect gastrointestinal infections can be considered if 
diarrhoea is predominant and difficult to treat, especially 
in regions where infectious diarrhoea is common’”*. As 
stated previously, there is currently no valid diagnostic 
biomarker, even though preliminary data have suggested 
that certain biomarkers or biomarker assays (BOX 3) for 
clinical use might prove to be valid following further 
scientific investigation’?!”. 


Alarm features 

Alarm features for IBS are symptoms that should raise 
the clinical concern of another gastrointestinal disease 
rather than IBS. Whether the use of alarm features (BOX 6) 
improves the performance of diagnostic criteria for IBS 
is not totally clear!”>'*!. However, from a clinical point of 
view, it seems reasonable to use these to select patients 
for further diagnostic testing, even though these may be 


12| 2016 | VOLUME 2 


www.nature.com/nrdp 


© 2016 Macmillan Publishers Limited. All rights reserved. 


Initial evaluation 


e Diagnostic criteria for IBS* 

e Other clinical features 

e Alarm symptoms present? 

e Physical examination 

e Routine laboratory tests (CBC, CRP and serological test for coeliac disease) 
Consider: thyroid profile, faecal calprotectin and stool analyses based 
on clinical presentation 


a a ——————— ar: 


e Positive symptom-based 


e Alarm features 
e Abnormal laboratory tests 
or physical examination 


e Severe, refractory symptoms 


diagnostic criteria for IBS 

e Other clinical features 
supporting a diagnosis of IBS 

e No alarm symptoms 

e Normal physical examination 
and laboratory tests 


e Predominant symptom 
e Type of alarm symptom 


finding 


q 


Further investigations based on: 


e Abnormal laboratory test or physical 


e Make a confident IBS diagnosis 

e Reassure 

e Explain 

e Treat according to the predominant 
symptom 


Figure 7 | A diagnostic algorithm for patients with IBS. This diagram gives a schematic 
overview of the sequential approach to irritable bowel syndrome (IBS) diagnosis'**. 
CBC, complete blood count; CRP C-reactive protein. Figure from REF. 144, 


Nature Publishing Group. 


present in a substantial proportion of patients without 
indicating a serious underlying condition in the gastro- 
intestinal tract’. Alarm symptoms can necessitate fur- 
ther investigations to rule out another gastrointestinal 
disease before an IBS diagnosis can be recommended. 
Moreover, the predominance of diarrhoea, especially 
when watery and frequent, should alert the clinician to 
consider alternative diagnoses’. 


Invasive investigations 

In the majority of patients with symptoms compatible 
with IBS and normal routine laboratory tests but without 
alarm features’, no additional invasive investigations are 
needed and, importantly, performing investigations does 
not seem to improve patient satisfaction or QOL'*"*. 

Colonoscopy should be performed when alarm fea- 
tures prompt an investigation and when there is suspicion 
of an inflammatory condition in the gastrointestinal tract 
based on history or laboratory parameters (increased 
CRP or faecal calprotectin levels)'*’, or based on the indi- 
cations for colorectal cancer screening in countries with 
population screening programmes'*”"*’. 

When the patient complains of watery diarrhoea as 
the predominant symptom, a colonoscopy with biopsies 
should also be considered to rule out microscopic colitis, 
especially in women >50 years of age'**!*. Moreover, bile 
acid-induced diarrhoea has recently been found to be a 
very important differential diagnosis in patients with IBS 
symptoms with frequent, loose stools*”’, and a diagnos- 
tic test should be considered (75-homocholic acid taurine 
(”SeHCAT) test or serum C4 levels)'*°. Unfortunately, 
these tests are not available in all centres, therefore a 
therapeutic trial with a bile acid-binding agent is often 
used as an indirect, but far from perfect, assessment of 
bile acid-induced diarrhoea. 
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Carbohydrate malabsorption is another differential 
diagnosis in patients with IBS-D'""’, and lactose or 
fructose hydrogen breath tests can be considered’*!”, 
but a trial period with dietary exclusion of the suspected 
carbohydrate for several weeks is often used instead. 

If coeliac disease is suspected, based on a positive 
serological test or the clinical history, an upper gastro- 
intestinal endoscopy with duodenal biopsies should be 
performed. Small intestinal bacterial overgrowth has 
been proposed to be common in IBS, but its preva- 
lence and clinical importance is uncertain, therefore 
routine clinical testing for this cannot be advocated’"*"”, 
especially as valid tests with adequate sensitivity and 
specificity are lacking. 


Management 

Only a fraction of patients with IBS-like symptoms 
(~50%) seek medical care!**. Most of these patients 
will initially consult primary care physicians for their 
symptoms, and the factors that drive this consultation 
are symptom severity, especially pain, the occurrence 
of alarm symptoms (BOX 6) and concerns that symp- 
toms might indicate an underlying severe disease — for 
example, cancer’’. Therefore, in many cases, gastro- 
intestinal specialist care is needed to exclude diseases 
that can mimic IBS symptoms — for example, by 
endoscopy. Once a positive diagnosis of IBS has been 
established, clinical management can be carried out 
as well by primary care physicians and at substantially 
lower costs'®. 

Management of IBS involves an integrated approach, 
including the establishment of an effective patient- 
provider relationship, education, reassurance, diet- 
ary alterations, pharmacotherapy and behavioural 
and psychological treatment'*'. Owing to the fact that 
~50-70% of patients with IBS report additional somatic 
and psychological symptoms when they are asked'*!”, 
a stepped-care approach including aspects of cogni- 
tive and interpersonal therapy is most appropriate”. 
The initial treatment strategy should be based on pre- 
dominant symptoms and includes antispasmodics for 
abdominal pain, antidiarrhoeals for IBS-D and lax- 
atives for IBS-C, whereas nutritional interventions and 
psychotherapy can be used in all subtypes. 


Nutrition 

Food ingestion is one of the most commonly reported 
factors that results in the exacerbation of symptoms 
among patients with IBS’. Postprandial symp- 
toms per se and fear of their occurrence (anticipatory 
anxiety) contribute profoundly to reduced QOL in 
IBS'’*. Up until recently, food-related symptoms had 
received scant attention from clinical scientists, leav- 
ing patients to find their own way through the plethora 
of usually non-validated and untested diagnostic tests 
and dietary regimens, which could result in clinically 
relevant nutritional deficits'’®. 

It has become evident that food intolerance (a physio- 
logical reaction to food allergens that is not associated 
with an immune response), and not classical IgE- 
mediated food allergy (which involves activation of the 
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Box 6| Alarm features for IBS 


e Unintended weight loss (>10% in 3 months) 


e Presence of blood in the stools not caused by haemorrhoids or anal fissures 


e Symptoms that awaken the patient in the night 


e Fever in association with the bowel symptoms 


e Family history of colorectal cancer, inflammatory bowel disease or coeliac disease 


e New onset of irritable bowel syndrome (IBS) symptoms after 50 years of age 


immune system), is the major mechanism responsible 
for symptomatic responses to certain foods'®. This is not 
to say that immune responses to food or food compo- 
nents are irrelevant for IBS. For example, one study 
demonstrated that exposure of the small intestine to cer- 
tain food antigens led to subtle ultrastructural changes 
in the duodenal mucosa of patients with IBS, but not in 
controls*!. Another study also reported local immune 
responses to gluten among a group of non-coeliac 
patients with IBS'®. Taken together, these observations 
leave the door open to the possibility that at least some 
patients with IBS may mount an, as yet to be defined, 
immunological response to certain dietary components, 
a response that seems to be confined to the mucosal 
immune system. 

How does one explain food-related symptoms in IBS? 
Given the primacy of food ingestion as a stimulus to most 
gastrointestinal functions, postprandial pain and rectal 
urgency in IBS could simply reflect an exaggeration 
of a normal physiological phenomenon. Exaggerated 
motor responses to food and, especially to lipids, have 
also been demonstrated in the small intestine in IBS'%”. 
Furthermore, tryptophan, the 5-HT precursor, and 
related compounds present in some foods could modu- 
late psychological comorbidities and gastrointestinal 
symptoms in IBS'®. Food-related symptoms could also 
be mediated through interactions between our diet, the 
products of digestion and the gut microbiota. Products 
of bacterial metabolism, such as deconjugated bile salts, 
SCFAs and gases, could exert potent effects on colonic 
physiology and thereby induce symptoms. 

Although patients with IBS readily incriminate 
specific food items as those that are especially likely to 
precipitate symptoms, only 11-27% of those are correctly 
identified when confirmed in formal, blinded food chal- 
lenge studies'’”. The limitations of dietary surveys and 
the poor reproducibility of reported food intolerances 
notwithstanding, some food items are reported as being 
more problematic: wheat, fruit and vegetables’. Current 
enthusiasm for diets low in FODMAPs is consistent with 
these observations. 

Fibre and fibre-based supplements accelerate colon 
transit, increase stool bulk and facilitate its passage, 
resulting in an increase in stool frequency. These 
effects translate into clinically meaningful benefits for 
people with chronic constipation and IBS-C. Indeed, 
fibre and products based on synthetic fibre-like sub- 
stances became a cornerstone in the management 
of IBS. However, RCTs found that not all patients 
gained relief and some even complained of exacer- 
bation of their symptoms (including pain, bloating 


and distension). Recent meta-analyses and systematic 
reviews have shed some light on this issue by showing 
that fibres are heterogeneous and the consumption of 
soluble fibres such as psyllium, calcium polycarbophil 
and ispaghula bring symptomatic benefits, whereas 
insoluble fibres, represented by bran, are ineffective in 
patients with IBS'®. 

Interest in the use of lbw FODMAP diets (BOX 5) in 
patients with IBS is increasing. RCTs have confirmed 
some beneficial effects of low FODMAP diets on IBS 
symptoms’”’, but they were not superior to conven- 
tional dietary advice when directly compared’. There 
are some limitations; studies to date have been small 
and, as has been the case with many studies of dietary 
interventions in IBS, suffer from some methodological 
limitations'”. Furthermore, low FODMAP diets are 
complex, may require supervision by a qualified diet- 
ician and involve the elimination of many food items 
commonly regarded as components of a ‘healthy’ 
diet. Some initial investigations suggest that the low 
FODMAP diet may suppress the growth of bacterial 
species commonly regarded as important components 
of healthy microbiota, such as bifidobacteria’. Included 
in the FODMAP category are some molecules, such as 
lactose, fructose and sorbitol; some patients with IBS 
may benefit from the removal of one of these substances 
alone’”’. Predicting responders is difficult, as commonly 
used challenge tests, such as the lactose or fructose 
breath hydrogen test, do not seem to be of value’”>!”°. 

The concept of ‘non-coeliac gluten sensitivity’ has 
been advanced to explain instances of IBS-type symp- 
toms that develop in individuals who do not satisfy diag- 
nostic criteria for the diagnosis of coeliac disease (that 
is, positive serology and appropriate changes in small 
intestinal morphology)'”’. This remains an unsettled 
and contentious issue with some studies reporting that, 
when tested in a blinded manner, gluten did induce 
the usual IBS symptoms in some patients with IBS!”*. 
Others argue that gluten contributes little to IBS symp- 
tomatology, but that fructans (FODMAPs contained 
in wheat), and not gluten, are the culprits of wheat- 
related problems. Results of clinical trials assessing the 
role of gluten exposure in IBS pathology have therefore, 
not surprisingly, yielded mixed results’”'*. Although 
gluten-free diets are currently enjoying considerable 
popularity among patients with IBS and the population 
at large in the United States, the rationale for gluten 
exclusion in IBS has yet to be firmly established. 

Patients with IBS commonly consume any one or 
combinations of a wide variety of dietary supplements 
ranging from vitamins to ‘digestive enzymes, anti- 
oxidants and essential oils. Few, if any, of these have 
been subjected to rigorous study. Prebiotics (non- 
digestible food ingredients that beneficially affect the 
host by selectively stimulating the growth and activity 
of one species or a limited number of species of bac- 
teria in the colon) and probiotics (live microbial food 
ingredients that alter the microflora and confer health 
benefit) have also been used for decades in IBS in the 
absence of supportive data. Prebiotics and probiotics are 
now subjected to more-rigorous studies, as they might 
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contribute to altered microbiota in IBS'*''**. Although 
these studies must be interpreted with care, a recent 
meta-analysis does suggest efficacy for probiotics (as a 
category) in IBS'**. However, high-quality RCTs remain 
few in number and available data provide scant infor- 
mation to assist the consumer in choosing a particular 
product to alleviate symptoms™ or to make a recom- 
mendation on prebiotics or synbiotics (a combination 
of a prebiotic and a probiotic) in IBS’®. 
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Figure 8 | Mechanisms of action of different drugs used for the treatment of IBS. 
Drugs currently used for the treatment of irritable bowel syndrome (IBS) (orange boxes) 
target nerve activity, epithelial functions or the contractile state of the smooth muscle 
layers. Several drugs act by enhancing the activity of chloride channels to increase fluid 
secretion into the intestinal lumen as a consequence. Other mechanisms of action 
include modulation of visceral sensitivity at a central or peripheral level. Finally, drugs act 
to modulate signal transduction at the neuromuscular junction or alter motility by direct 
myogenic actions. The plus and minus symbols indicate whether a mediator activates or 
inhibits its target cell; those in parentheses denote actions established in animal models 
and those without parentheses are effects demonstrated in humans (human tissue). 
5-HT, 5-hydroxytryptamine (also known as serotonin); ACh, acetylcholine; CFTR, cystic 
fibrosis transmembrane conductance regulator; ClC2, chloride channel protein 2; 
GC-C, guanylyl cyclase C; VIP, vasoactive intestinal peptide. 
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Drug therapy 

Broadly speaking, the current therapeutic armamentar- 
ium in IBS aims to alter predominant problematic bowel 
habits and/or visceral pain. However, an emerging area 
is manipulation of the gastrointestinal microbiota. 


Antispasmodic drugs. Pain in IBS is mediated through 
central and peripheral mechanisms, and is in part the 
result of smooth muscle spasms. The mode of action of 
antispasmodic drugs is probably their ability to antago- 
nize the binding of acetylcholine to the muscarinic 
receptor at the neuromuscular junction, with smooth 
muscle relaxation as a consequence’**. Some studies 
have demonstrated a beneficial effect of otilonium bro- 
mide and hyoscine over placebo, with a number needed 
to treat (NNT) of four patients'’’. An adverse effect of 
anti-muscarinic agents is constipation because of their 
strong inhibition of intraluminal fluid secretion’. 
Accordingly, these drugs are best used in patients with- 
out constipation and should be taken 20 minutes before 
meals to ease postprandial symptoms. Peppermint oil, 
which also inhibits smooth muscle contraction albeit 
by calcium channel blockade, is beneficial in reducing 
IBS symptoms'**. A recent RCT in patients with IBS-D 
and IBS-M demonstrated that a novel formulation of 
peppermint oil, designed to cause a sustained release 
within the small bowel, was superior to placebo in 
causing a reduction in total symptoms’. 


Low-dose antidepressants. Antidepressants, such as 
tricyclic antidepressants (TCAs) or selective sero- 
tonin reuptake inhibitors (SSRIs), are recommended 
by existing guidelines for the treatment of pain in 
patients who are refractory to antispasmodics and 
dietary alterations’”’. However, these drugs are not 
licensed anywhere in the world for the treatment of 
patients with IBS, and their use is off-label. Given 
the lack of licensed indication, the rationale for using 
such drugs should be discussed in detail with patients. 
The exact analgesic mechanism of action of low-dose 
antidepressants is incompletely understood but is 
considered to be both peripheral, via alterations of 
histaminergic and/or cholinergic transmission within 
the gastrointestinal tract, and central, via modula- 
tion of both ascending visceral sensory afferents and 
central transmission’’'. SSRIs are generally well toler- 
ated. Adverse effects such as constipation, dry mouth, 
drowsiness and fatigue are reported with TCAs. TCAs 
may be particularly effective for treating pain in 
patients with IBS-D, but are less suitable for patients 
who have IBS-C. 


Laxatives and motility accelerants. In those with con- 
stipation, simple laxatives such as senna and docusate 
are often effective in managing symptoms. However, 
the use of lactulose is not recommended as it is often 
poorly tolerated by patients with IBS because of wors- 
ening of bloating and discomfort. Linaclotide, a min- 
imally absorbed guanylyl cyclase C agonist peptide 
(FIG. 8), can be used as second-line therapy after lax- 
atives have failed in patients with IBS-C and symptoms 
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have lasted for >1 year. Linaclotide has a dual action 
through increasing intraluminal fluid secretion thereby 
giving its laxation effect but also an analgesic effect via 
modulation of colonic nociceptors'”’, and its effects 
caused reduced abdominal pain, bloating and bowel 
symptoms in two well-designed Phase III RCTs!?*"*. 
Lubiprostone, a minimally absorbed, locally active, 
bicyclic fatty acid derivative of prostaglandin E1, acti- 
vates type 2 chloride channels on the enterocyctic 
apical membrane, thereby stimulating fluid secretion. 
Lubiprostone has been shown to improve global intes- 
tinal symptoms in IBS-C’”. 5-HT4 receptor agonists 
(such as prucalopride), which promote gut motility 
through the activation of the serotoninergic pathways, 
have been shown to be effective in increasing com- 
plete spontaneous bowel movements in patients with 
chronic constipation’”’. 


Antidiarrhoeals. The u-opioid receptor agonist lopera- 
mide is frequently used as a first-line agent in IBS-D 
and improves diarrhoea by inducing peristalsis, which 
prolongs the gastrointestinal transit time. As lopera- 
mide does not cross the blood-brain barrier, central 
adverse effects are limited. Its main benefit is reducing 
stool frequency and defaecation urgency, and improv- 
ing the consistency of the stool'*’”. Eluxadoline, a mixed 
u-opioid receptor agonist and 6-opioid receptor antago- 
nist, has been evaluated in a Phase III RCT, although 
safety concerns have been expressed concerning the 
excess rates of pancreatitis’”®. 

5-HT3 receptor antagonists, such as alosetron, 
ramosetron and ondansetron, are effective in the 
management of IBS-D symptoms. The mechanism of 
action of 5-HT3 receptor antagonists is complex and 
incompletely understood, but is considered to pro- 
ceed through inhibition of the ascending excitatory 
component of the peristaltic reflex and of the high 
amplitude propagating contractions within the gastro- 
intestinal tract!”’. However, a central effect of 5-HT3 
receptor antagonists on pain cannot be excluded”. 
Safety concerns, with respect to ischaemic colitis, have 
been confined to alosetron, which subsequently led to 
restrictions in its prescription”. Consequently, other 
5-HT3 receptor antagonists have been investigated, with 
ondansetron*” and ramosetron demonstrating efficacy 
in RCTs”. 


Manipulation of the microbiota. Given the burgeon- 
ing evidence of the role of the microbiota in IBS, 
both antibiotics and probiotics have been evaluated. 
The non-absorbable antibiotic, rifaximin, has been 
demonstrated to cause a reduction in symptoms, 
with a NNT of approximately 11 patients, although 
it is not clear whether repeated courses of treatment 
are needed. The mechanisms by which rifaximin 
exerts its positive effects on IBS symptoms are incom- 
pletely understood and may include modulation of 
the gut microbiota, but also direct effects on local 
micro-inflammation. Rifaximin is approved for use in 
the United States, but has not yet received regulatory 
approval in Europe. Probiotics can reduce pain and 


symptom severity, although recent meta-analyses have 
highlighted that inconsistencies in study design render 
definitive recommendations problematic'®*'**; again, 
it is unclear whether probiotics act on IBS symptoms 
through direct modulation of the microbiota, indirect 
via the gut immune system or otherwise. 


Others. A proportion of patients use herbal supplements 
either as single herbs or in combination. Four weeks of 
treatment with iberogast, which is a mixture of nine 
plant extracts, improved abdominal pain and QOL ina 
double-blind RCT of 208 patients with all types of IBS”. 
Although the mechanism of action is poorly understood, 
it is probably multifaceted via acetylcholine, 5-HT and 
opioid receptors in the gastrointestinal tract*’”. Although 
herbal remedies represent a promising intervention, fur- 
ther rigorously designed larger RCTs in the subtypes of 
IBS are needed. 


Psychotherapy 

The biopsychosocial model of IBS suggests that 
abdominal symptoms secondarily influence anxiety 
and depression (bottom-up) and psychosocial factors 
influence physiological factors, such as motor func- 
tion, sensory threshold and stress reactivity of the gut 
(top-down)”™. 

Treatment concepts that target these psycho- 
social factors of patients with IBS should be based on 
evidence-based models that take the following three 
components into account: altered peripheral regula- 
tion of gut function, altered brain-gut signalling and 
reducing psychological distress, including general 
hypervigilance and a general mindset of catastro- 
phizing*”. Such models might be helpful as a basis of 
patient education and a target for effective treatments. 
To further improve treatment programmes, we have to 
learn more about IBS-specific interactions and the role 
of stress and visceral sensitivity for clearer evidence 
on which group of patients might benefit from which 
treatment approach. In addition, it should be noted 
that patients with IBS often experience additional 
functional symptoms, pointing to the complexity of 
the condition”. 

The effect of IBS symptoms on patients’ feelings of 
shame, fearfulness and embarrassment is well estab- 
lished; patients report being poorly understood by 
their physicians, as well as by their family members and 
friends*'’. Patients who experience a positive therapeu- 
tic physician-patient relationship have fewer IBS-related 
follow-up visits”"’. 

International treatment guidelines for IBS have 
advocated for a graded treatment approach’!”*!’. The 
National Institute of Health and Care Excellence 
(NICE) guidelines advise that patients whose symp- 
toms do not respond to pharmacological treatments 
after 12 months and who develop a continuing symp- 
tom profile (refractory IBS) should be considered 
for referral to cognitive—behavioural therapy (CBT), 
hypnotherapy (gut-directed hypnosis) or other psycho- 
logical therapy, such as psychodynamic (interpersonal) 
therapy and mindfulness-based therapy’”’. 
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BOX 7 describes the four major psychological-based 
therapies for patients with IBS. Several meta-analyses 
have been performed in the field of psychological and 
behavioural therapies (including studies in stress reduc- 
tion and relaxation) that took 45 RCTs into account with 
a total of 3,325 patients with IBS of all subtypes (TABLE 1). 
Overall, the NNT for psychological therapies is four 
patients (95% CI: 3-5) and, therefore, better than the 
majority of drugs”. In a stepped-care approach (begin- 
ning with the least intensive or invasive treatment and 
stepping up or down depending on the needs of the 
patients), a psychology-based self-aid (educational) 
approach has been shown recently in a meta-analysis as 
an effective treatment option for all subtypes of IBS”*”. 
Compared with control treatments, a medium effect size 
was demonstrated on decreased symptom severity and a 
large effect size on increased patient’s QOL. 

The best evidence is available for CBT. Although 
CBT is not routinely available in primary care, it can be 
accessed in some local hospitals and health-care systems. 
There are medium-to-large significant pooled effect 
sizes for an improvement of IBS symptoms using CBT 
with a medium significant pooled effect size for QOL 
and a small-to-medium pooled effect size for psycho- 
logical comorbid symptoms. The NNT for CBT is only 
three patients, with a limited variance between the RCTs. 
Nevertheless, to date there is no evidence of a superiority 
of CBT compared with other psychological treatments 


in IBS. 


Box 7 | Evidence-based psychological treatments 


Cognitive—behavioural therapy 

Cognitive—behavioural therapy (CBT) is based on the assumption that irritable bowel 
syndrome (IBS) symptoms are a response to stressful life events, maladaptive 
behaviour and an inappropriate attribution of symptoms. CBT aims to modify these 
behaviours and thoughts through education, which consists of the explanation of IBS 
symptoms and the CBT model, and by identification of the psychological factors that 
are interacting with their physical symptoms. On the basis of these findings, patients 
and therapists work together to identify the potential associations between IBS 
symptoms and their thoughts, emotions and actions. Finally, behavioural therapy 

(for example, stress management) is applied””. 


Psychodynamic (interpersonal) therapy 

Psychodynamic (interpersonal) therapy (PIT) aims to obtain insights into symptom 
development as a consequence of interpersonal conflicts or difficulties in relationships 
with key people. Patients are encouraged to discuss their symptoms in depth, 
emotional factors are explored and links between symptoms and emotional factors 

are identified*”*. 


Gut-directed hypnosis 

In gut-directed hypnosis (GDH), as opposed to standard hypnotherapy, suggestions 
are made on how to control and normalize gastrointestinal function and metaphors are 
used to bring about improvement. GDH differs from other forms of psychological 
treatment in which therapy is provided to patients in a conscious state. After 
information on the effects of hypnosis is given, participants are provided with a 
compact disk (created by hypnotherapists) for practicing at home on a daily basis”*’. 


Mindfulness-based therapy 

Mindfulness-based therapy (MBT) for IBS has been adapted from the mindfulness-based 
stress reduction programme. The basic course emphasizes the relevance of mindfulness 
in coping with IBS-related symptoms and perceptions. With a range of behavioural and 
cognitive techniques, MBT promotes sensory versus emotional processing of 
interoceptive signals and counteracts catastrophizing as a maladaptive cognitive 
coping style*». 
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Validation of psychodynamic (interpersonal) ther- 
apy, gut-directed hypnosis and mindfulness-based 
therapy (BOX 7) has only been done in a very limited 
number of tertiary treatment centres and the general- 
ization of these treatment approaches is limited. Finally, 
mindfulness-based therapy for IBS shows some prom- 
ising initial results, particularly in the subgroup of 
female patients with IBS”'*. Very limited data on multi- 
component therapies and on the combination of anti- 
depressants and psychological treatments are available’®. 
Overall, there is a lack of reports of adverse effects of 
psychological and behavioural treatment approaches and 
treatment resistance in patients with IBS. Psychological 
therapies have also regularly not distinguished between 
IBS subtypes and, thus, might have missed differential 
indications and advantages and disadvantages. 


Quality of life 

In the field of medicine, general QOL and disease-specific 
QOL are distinguished. General QOL is a measure of the 
entire health perception ofa person. Representative gen- 
eral QOL can be assessed using the Medical Outcome 
Study 36-item Short-Form Health Survey (SF-36)*’° or 
the EuroQOL survey”””. SF-36 is the most popular instru- 
ment that can evaluate physical functioning, physical 
role, bodily pain, general health perceptions, vitality, 
social functioning, emotional role and mental health’"®. 
Disease-specific QOL is a measure of life disturbance that 
is specifically caused by the disease”'*’)’. 

QOL in patients with IBS is greatly disturbed. 
Patients with IBS showed impaired general QOL with 
lower values on all SF-36 subscales except physical func- 
tioning than healthy controls in one study*”’, whereas 
lower values on the SF-36 subscales in patients with IBS 
(except physical functioning, physical role and emo- 
tional role) than in healthy controls were observed in 
another study". All subscales of SF-36, except the physi- 
cal functioning and physical role domains, were lower in 
patients with IBS than in healthy controls regardless of 
culture’. The degree of disturbance of general QOL in 
patients with IBS has been shown to be worse in several 
subcategories than in those with gastroesophageal reflux 
disease, diabetes mellitus or severe chronic kidney dis- 
ease’’’. Finally, a study has shown that patients with IBS 
had more disturbed general QOL in physical role, bodily 
pain, general health perceptions and social functioning 
than non-consulters with IBS (individuals who do not 
seek treatment)". 

QOL seems to be the same among IBS subtypes. 
However, disease-specific QOL, as measured with the 
IBS-QOL in patients with IBS-D or IBS-M, was worse 
than in patients with IBS-C in one study’. In this study, 
increased food avoidance in patients with IBS-D and 
IBS-M may have been responsible for the lower QOL”, 
but there are controversial reports”"®. 

In severe IBS, both gastrointestinal symptoms and 
psychiatric comorbidity independently contribute to 
disturbed QOL” (FIG. 9). Another study revealed that 
the QOL of patients with IBS was more influenced 
by the extraintestinal symptoms — such as tiring 
easily, low in energy, the feeling that there is something 
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seriously wrong with their body, feeling tense, feeling 
nervous, feeling hopeless, difficulty sleeping and low 
sexual interest — than by gastrointestinal symptoms™. 
The psychological and psychosocial dimensions of food 
ingestion might also have a role. Eating with family and 
friends is probably the most common form of social 
interaction worldwide. An inability to participate in 
such a fundamental component of social intercourse 
because of a fear of pain, urgency, diarrhoea or disten- 
sion occurring during or immediately after a meal can 
be devastating and can result in social isolation’"®. 
Systematic reviews have clarified that improvement 
of IBS-related pain by treatment results in better QOL in 
patients with IBS’. The disease-specific IBS-QOL and 
IBS-QOL questionnaires can measure the efficacy of 
treatment, especially long-term therapies”. Although 
the SF-36 can also detect the efficacy of long-term treat- 
ment (>1 year), it is less sensitive than the IBS-QOL. 
Both measures struggle to detect drug or psychotherapy 
efficacy in the short term (<1 month)’, but IBS- 
QOL is sensitive enough to detect efficacy for mid-term 


(3 months) treatment’”. A therapeutic gain of 214 points 
in IBS-QOL denotes a clinically meaningful change. 
Even if primary end points based on cardinal symptoms 
of IBS are similar between treatments, a treatment result- 
ing in better QOL may be preferred by patients over 
another treatment that does not improve QOL. 


Outlook 

The field of research into IBS has expanded consider- 
ably over the past decade with many new studies, in part 
driven by the development of new therapeutic agents. 
This trajectory seems likely to continue as patients with 
IBS account for a substantial proportion of all gastro- 
intestinal consultations, and many questions in the field 
remain unanswered (BOX 8). 


Patient stratification and biomarkers 

Many classes of drugs have been evaluated by RCTs in 
IBS and these have often produced disappointingly small 
differences from placebo'*’”!*”**, These small differences 
conceal the fact that some patients benefit from the drugs. 


Table 1 | Evidence-based psychological treatments for IBS 


Comments 


e CBT was superior to waiting lists, basic 
support or medical treatment alone at 
the end of treatment but not superior 
to other psychological treatments 


e PIT is less well standardized in terms 
of its performance (that is, duration, 
setting and phases) 


Psychological nofstudies Main findings 
treatment (n of 
approach* participants) 
CB 18 RCTs e Symptom score: medium-to-large significant pooled effect size* (0.67) 
(1,380) e QOL: medium significant pooled effect size (0.48) 
e Psychological distress (depression and anxiety): small-to-medium 
pooled effect size (0.21) 
e NNT for CBT was 3 (95% Cl: 2-6) 
Pia 2 RETs (273) Both studies compared PIT with ‘supportive listening’ applied by the 
same therapist. Compared with controls: 
e PIT significantly improved symptoms 
e PIT showed a large cost-effectiveness 
e PIT was widely acceptable 
e PIT significantly improved QOL 
e PIT significantly reduced costs 
e The calculated OR for benefit was 2.92 (95% Cl: 1.76-4.83) 
e NNT for dynamic psychotherapy was 3.5 (95% Cl: 2-25) 
GIDIale" 7 RCTs (452) ¢ 6 of 7RCTs reported a significant reduction (all P< 0.05) in overall 
gastrointestinal symptoms compared with supportive therapy only 
e Response rates ranged between 24% and 73% 
e Efficacy was maintained long term in four of five studies 
e NNT was 4 (95% Cl: 3-8) 
MBT??> 2 RCTs (79) e Women showed greater reductions of symptoms compared with a 


Relaxation’ = 6RCTs(255) 


GSiHs?? 10 RCTs (886) 


control group immediately after training (26.4% versus 6.2%; P=0.006) 
and at 3 months follow-up (38.2% compared with 11.8%; P=0.001) 

e Changes in QOL, distress and anxiety were not different between 
groups immediately after treatment 

e Significantly greater improvement in the MBT group than in the 
control group evident at 3 months follow-up 

e The beneficial effects persisted for >3 months 


e Overall, no benefit of relaxation training or therapy in IBS was 
detected in the RCTs 


e Compared with control conditions, a moderate effect size on 
symptom severity (0.72) and a large effect size on the increase of 
patients’ QOL (0.84) was found 


e Very few professionals are trained for 
the specific implementation of GDH 
and therefore their services can be 
difficult to access 

e The mechanisms by which GDH exerts 
its effect are poorly understood 


e In another RCT, the IBS symptom 
severity in the mindfulness-based 
stress reduction group was not 
retained at 6 months follow-up 


e The field of studies on relaxation 
techniques is diverse 


e GSHs might be an easily accessible and 
a cost-effective treatment alternative. 
However, there is a wide heterogeneity 
and variance in its performance 


The NNT data are based on Ford et al.’!*. CBT, cognitive—-behavioural therapy; GDH, gut-directed hypnosis; GSH, guided self-help intervention; IBS, irritable bowel 
syndrome; MBIT, mindfulness-based therapy; NNT, number needed to treat; OR, odds ratio; PIT, psychodynamic (interpersonal) therapy; QOL, quality of life; 

RCT, randomized controlled trial. *See REF. 245. *Effect size (for example, Cohen’s d): effect sizes of 0.2-0.5 are regarded as small, between 0.5 and 0.8 as moderate 
and >0.8 as large. Methods and techniques applied are progressive muscle relaxation, biofeedback and transcendental or yoga meditations. 
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Figure 9 | Concept of multifactorial quality-of-life effects in IBS. The genome and 
epigenome partially determine (‘filter’) the response of an individual to external stressors 
(psychosocial factors) and internal stressors (ingested food or microbiota). These, 
together with social support, appraisal, emotion and coping behaviours against stressors, 
determine the stress response affecting the brain—gut interactions. This response might 
involve regional brain activation, changes in autonomic and neuroendocrine function, 
which might lead to many of the clinical manifestations observed in irritable bowel 
syndrome (IBS), including visceral hypersensitivity, alteration in gastrointestinal motility, 
increased mucosal permeability and low-grade inflammation. These gastrointestinal 
symptoms and other extra-intestinal manifestations (such as multiple somatic symptoms 
and psychiatric comorbidities) impair the quality of life (QOL) of patients with IBS. 


Proper stratification of patients by relevant underlying 
disease mechanism has been an issue, therefore many 
trials use unselected patients with IBS, independent of 
the underlying disease mechanisms and clinical presenta- 
tions. The use of 5-HT receptor-modulating drugs has 
taught the research community that restricting 5-HT3 
receptor antagonists to patients without constipation 
improved their effectiveness with significant differences 
from placebo”, owing to the fact that 5-HT3 recep- 
tor antagonists slow transit and aggravate constipation. 
However, RCTs rarely measure transit as a requirement 
for trial entry, which depends on symptoms recorded in 
daily symptom diaries. The use of more-objective bio- 
markers to select patients for RCTs would be expected to 
improve the effect size and reduce the number needed 
to test to show a significant difference from placebo. 
The lack of reproducible, widely available biomark- 
ers that reflect the targets of ‘older’ drugs has been a 
considerable limitation. Antispasmodics are a good 
example of such drugs that have fallen out of favour 
because we cannot reliably identify those with excessive 
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motor activity who might be expected to respond. 
Future novel non-invasive motility assessments, such as 
MRI**!, capsule endoscopy”” and the pressure-sensitive, 
temperature-sensitive and pH-sensitive SmartPill 
(Given Imaging Ltd, Yoqneam, Israel)”*? (which can 
measure intestinal contractions), hold the possibility of 
identifying such patients in the future. 

Although individual genetic markers seem likely to 
be associated with only quite modest increases in risk 
for IBS, they might be important predictors of drug sen- 
sitivity in particular pathways. 5-HT3 receptor antago- 
nists are good examples of drugs with a wide range of 
sensitivities such that effective doses for one patient 
can produce unacceptable constipation in another. 
This finding may be due to a combination of important 
functional polymorphisms in genes involved in 5-HT 
synthesis (tryptophan hydroxylase 1 (T'PH1)), those 
involved in 5-HT reuptake via the 5-HT transporter 
(SLC6A4) and polymorphisms in the 5-HT3 receptor 
genes (which alter sensitivity). Several small studies 
have suggested significant differences in responder 
status to one 5-HT3 receptor antagonist, ramosetron, 
according to polymorphisms in TPH] (REF. 234) and to 
another 5-HT3 receptor antagonist, alosetron, accord- 
ing to polymorphisms in SLC6A4 (REF. 235). However, 
these studies are underpowered and have not yet been 
reproduced”. By analogy with other complex dis- 
orders”’®, the effect of any one individual polymorphism 
may be limited but combining polymorphisms that pre- 
dict low 5-HT production with rapid uptake and low 
receptor sensitivity would be expected to be associated 
with higher odds ratios for success of 5-HT manipula- 
tion. Future studies should be powered to examine this 
notion such that the dose can be tailored to individ- 
ual patients. Similarly, polymorphisms in the FGF19- 
FGFR4 pathway, which controls bile acid synthesis!°”'’, 
influence colonic transit and should be explored to see 
if different combinations alter sensitivity to bile acid 
sequestrants or bile acid transporter inhibitors. 


Mode of action of food intolerances 

Dietary restrictions such as low FODMAP diets (BOX 5) 
are another example in which implementation of an 
effective treatment is hampered by lack of biomarkers 
to predict response or reliably identify the key com- 
ponent (or components) of food that are responsible 
for symptoms. Although poorly absorbed fermentable 
carbohydrates can undoubtedly cause symptoms in 
some patients, visceral sensitivity is the key to why some 
individuals experience symptoms and some do not”, 
at least in the case of lactose malabsorption. However, 
no trial of lactose exclusion in IBS has used measures 
of sensitivity to stratify patients. While rectal barostat 
tests to assess visceral sensitivity are difficult, although 
not impossible to standardize across centres, alternatives 
might be to use simple cutaneous pressure or thermal 
stimulation’’*. More remotely, somatization question- 
naires concerning non-gastrointestinal symptoms 
such as headache, backache, dyspnoea and palpitations 
have been shown to correlate, albeit weakly, with rectal 
distension pressure thresholds for pain”. 
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Box 8 | Key questions to be addressed in future research 


Can we develop clinically applicable biomarkers to stratify patients to disease 
mechanisms, thereby reducing the number of patients needed to evaluate new 
therapeutic agents? Possible factors that should be taken into account are: 


e Transit time 

e Evidence of bile acid excess 

e Immune activation 

e Biopsy supernatant mediators that activate enteric neurons 

e Mucosal serotonin availability 

Can we assess the role of genetic markers in irritable bowel syndrome? Possible factors 
that should be taken into account are: 

e Gene and environment interactions 

e Biomarker discovery — for example, by genome-wide association studies 

e Pharmacogenetics 

Can we identify the mode of action of food intolerances to allow rational designs of diets? 
Possible tests are: 

e Nutrient challenge meals 

e MRI studies of intestinal volumes and gas or water content of the stool 

Can we characterize the functional effects of changes in microbiota to improve efficacy 
of manipulation of the microbiota as a novel therapy? Possible studies are: 


e Randomized controlled trials of fermentable oligosaccharides, disaccharides, 
monosaccharides and polyols (FODMAP) intervention with assessment of changes 
in microbiota 


e Effect of placebo-controlled diets on faecal or serum bacterial metabolites 


The physical form of food is another key variable 
whose importance is yet to be defined. Many of the diet- 
ary components implicated in IBS symptoms are actu- 
ally consumed as solids and hence delivered into the 
duodenum more slowly after trituration by antral con- 
tractions. The rapid entry of osmotically active poorly 
absorbed substrates — mainly in liquid form — such as 
lactose in a patient with lactose malabsorption” or 
mannitol in healthy volunteers™' result in a rapid influx of 
water into the small intestine, which probably stimulates 


transit and rapid delivery into the colon. This leads to 
the virtually instantaneous generation of gas*’, mainly 
hydrogen, given that the microbiota are unable to fully 
metabolize the sudden excess of substrate. Furthermore, 
distension of the ascending colon generates propul- 
sive colonic motility, which a sensitized individual may 
experience as cramps; a slower delivery in a solid matrix 
may be better tolerated. Future studies should define how 
the physical form of FODMAPs alters their tolerability, 
which would allow a less restrictive diet that may be easier 
to follow and, hence, more widely adopted than at present. 


Functional effect of changes in microbiota 

Many studies have found profound differences in the 
microbiota of selected patients with IBS, but the agree- 
ment on the involved species between studies is poor?’. 
Given the very large number of different species that 
have overlapping metabolic capabilities and functional 
effects, focusing on function may be more helpful than 
just identifying the species present. 

Analysis of urine and stool metabolites, including bile 
acids and endogenous tryptase, may provide simpler bio- 
markers of function that could predict responsiveness to 
microbiota manipulation. Thus, low levels of butyrate, a 
SCFA, might encourage the provision of prebiotics that 
favour butyrate-producing bacteria, such as Eubacterium 
rectale and Roseburia cecicola. Future studies should also 
take into account the important role of transit time and its 
variability. The challenge of rapid transit favours organ- 
isms with either enhanced growth capacity or those that 
adhere to the mucosa to deal with rapid flow within the 
colon’, although, these results need to be replicated and 
studied in more detail to enable dissection of the extent 
to which differences in microbiota are the cause or the 
effect of rapid transit. Better insight might also enable 
the tailoring of diet to the existing microbiota in a patient, 
based on their metabolic capabilities and response to a 
substrate provided in the diet. 
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